wasm-micro-runtime

mirror of https://github.com/bytecodealliance/wasm-micro-runtime.git synced 2025-07-12 23:43:25 +00:00

Author	SHA1	Message	Date
Wenyong Huang	a23fa9f86c	Implement memory64 for classic interpreter (#3266 ) Adding a new cmake flag (cache variable) `WAMR_BUILD_MEMORY64` to enable the memory64 feature, it can only be enabled on the 64-bit platform/target and can only use software boundary check. And when it is enabled, it can support both i32 and i64 linear memory types. The main modifications are: - wasm loader & mini-loader: loading and bytecode validating process - wasm runtime: memory instantiating process - classic-interpreter: wasm code executing process - Support memory64 memory in related runtime APIs - Modify main function type check when it's memory64 wasm file - Modify `wasm_runtime_invoke_native` and `wasm_runtime_invoke_native_raw` to handle registered native function pointer argument when memory64 is enabled - memory64 classic-interpreter spec test in `test_wamr.sh` and in CI Currently, it supports memory64 memory wasm file that uses core spec (including bulk memory proposal) opcodes and threads opcodes. ps. https://github.com/bytecodealliance/wasm-micro-runtime/issues/3091 https://github.com/bytecodealliance/wasm-micro-runtime/pull/3240 https://github.com/bytecodealliance/wasm-micro-runtime/pull/3260	2024-04-02 15:22:07 +08:00
TianlongLiang	c3e33a96ea	Remove unused argument in wasm_runtime_lookup_function and refactor WASMModuleInstance (#3218 ) Remove the unused parameter `signature` from `wasm_runtime_lookup_function`. Refactor the layout of WASMModuleInstance structure: - move common data members `c_api_func_imports` and `cur_exec_env` from `WASMModuleInstanceExtraCommon` to `WASMModuleInstance` - In `WASMModuleInstance`, enlarge `reserved[3]` to `reserved[5]` in case that we need to add more fields in the future ps. https://github.com/bytecodealliance/wasm-micro-runtime/issues/2530 https://github.com/bytecodealliance/wasm-micro-runtime/issues/3202	2024-03-13 12:28:45 +08:00
Wenyong Huang	0ee5ffce85	Refactor APIs and data structures as preliminary work for Memory64 (#3209 ) # Change the data type representing linear memory address from u32 to u64 ## APIs signature changes - (Export)wasm_runtime_module_malloc - wasm_module_malloc - wasm_module_malloc_internal - aot_module_malloc - aot_module_malloc_internal - wasm_runtime_module_realloc - wasm_module_realloc - wasm_module_realloc_internal - aot_module_realloc - aot_module_realloc_internal - (Export)wasm_runtime_module_free - wasm_module_free - wasm_module_free_internal - aot_module_malloc - aot_module_free_internal - (Export)wasm_runtime_module_dup_data - wasm_module_dup_data - aot_module_dup_data - (Export)wasm_runtime_validate_app_addr - (Export)wasm_runtime_validate_app_str_addr - (Export)wasm_runtime_validate_native_addr - (Export)wasm_runtime_addr_app_to_native - (Export)wasm_runtime_addr_native_to_app - (Export)wasm_runtime_get_app_addr_range - aot_set_aux_stack - aot_get_aux_stack - wasm_set_aux_stack - wasm_get_aux_stack - aot_check_app_addr_and_convert, wasm_check_app_addr_and_convert and jit_check_app_addr_and_convert - wasm_exec_env_set_aux_stack - wasm_exec_env_get_aux_stack - wasm_cluster_create_thread - wasm_cluster_allocate_aux_stack - wasm_cluster_free_aux_stack ## Data structure changes - WASMModule and AOTModule - field aux_data_end, aux_heap_base and aux_stack_bottom - WASMExecEnv - field aux_stack_boundary and aux_stack_bottom - AOTCompData - field aux_data_end, aux_heap_base and aux_stack_bottom - WASMMemoryInstance(AOTMemoryInstance) - field memory_data_size and change __padding to is_memory64 - WASMModuleInstMemConsumption - field total_size and memories_size - WASMDebugExecutionMemory - field start_offset and current_pos - WASMCluster - field stack_tops ## Components that are affected by the APIs and data structure changes - libc-builtin - libc-emcc - libc-uvwasi - libc-wasi - Python and Go Language Embedding - Interpreter Debug engine - Multi-thread: lib-pthread, wasi-threads and thread manager	2024-03-12 11:38:50 +08:00
Wenyong Huang	3a0e86454e	fast-interp: Fix GC opcode ref.as_non_null (#3156 ) The issue was found in https://github.com/bytecodealliance/wasm-micro-runtime/issues/3151.	2024-02-17 11:54:49 +08:00
Wenyong Huang	16a4d71b34	Implement GC (Garbage Collection) feature for interpreter, AOT and LLVM-JIT (#3125 ) Implement the GC (Garbage Collection) feature for interpreter mode, AOT mode and LLVM-JIT mode, and support most features of the latest spec proposal, and also enable the stringref feature. Use `cmake -DWAMR_BUILD_GC=1/0` to enable/disable the feature, and `wamrc --enable-gc` to generate the AOT file with GC supported. And update the AOT file version from 2 to 3 since there are many AOT ABI breaks, including the changes of AOT file format, the changes of AOT module/memory instance layouts, the AOT runtime APIs for the AOT code to invoke and so on.	2024-02-06 20:47:11 +08:00
TianlongLiang	f359b51525	Fix threads opcodes' boundary check in classic-interp and fast-interp (#3136 ) Using `CHECK_BULK_MEMORY_OVERFLOW(addr + offset, n, maddr)` to do the boundary check may encounter integer overflow in `addr + offset`, change to use `CHECK_MEMORY_OVERFLOW(n)` instead, which converts `addr` and `offset` to uint64 first and then add them to avoid integer overflow.	2024-02-06 11:52:30 +08:00
YAMAMOTO Takashi	529fa9dd17	EH: Fix broken stack usage calculation (#3121 ) Fixes: https://github.com/bytecodealliance/wasm-micro-runtime/issues/3108	2024-02-03 12:21:15 +08:00
Wenyong Huang	2eb60060d8	Fix read and validation of misc/simd/atomic sub opcodes (#3115 ) The format of sub opcodes after misc, simd and atomic prefix is leb u32. The issue was found in #2921.	2024-02-02 12:03:58 +08:00
YAMAMOTO Takashi	10e87d2966	EH: Don't call word_copy with zero size (#3105 )	2024-01-31 21:54:19 +08:00
Wenyong Huang	af318bac81	Implement Exception Handling for classic interpreter (#3096 ) This PR adds the initial support for WASM exception handling: * Inside the classic interpreter only: * Initial handling of Tags * Initial handling of Exceptions based on W3C Exception Proposal * Import and Export of Exceptions and Tags * Add `cmake -DWAMR_BUILD_EXCE_HANDLING=1/0` option to enable/disable the feature, and by default it is disabled * Update the wamr-test-suites scripts to test the feature * Additional CI/CD changes to validate the exception spec proposal cases Refer to: https://github.com/bytecodealliance/wasm-micro-runtime/issues/1884 `587513f3c6` `8bebfe9ad7` `59bccdfed8` Signed-off-by: Ricardo Aguilar <ricardoaguilar@siemens.com> Co-authored-by: Chris Woods <chris.woods@siemens.com> Co-authored-by: Rene Ermler <rene.ermler@siemens.com> Co-authored-by: Trenner Thomas <trenner.thomas@siemens.com>	2024-01-31 08:27:17 +08:00
liang.he	99bbad8cdb	perf profiling: Adjust the calculation of execution time (#3089 )	2024-01-26 18:06:21 +08:00
Wenyong Huang	313ce8cb61	Fix memory/table segment checks in memory.init/table.init (#3081 ) According to the wasm core spec, the checks for the table segments in `table.init` opcode are similar to the checks for `memory.init` opcode: - The size of a passive segment is shrunk to zero after `data.drop` (or `elem.drop`) opcode is executed, and the segment can be used to do `memory.init` (or `table.init`) again - The `memory.init` only traps when `s+n > len(data.data)` or `d+n > len(mem.data)` and `table.init` only traps when `s+n > len(elem.elem)` or `d+n > len(tab.elem)` - The active segment can also be used to do `memory.init` (or `table.init`), while it behaves like a dropped passive segment https://github.com/WebAssembly/bulk-memory-operations/blob/master/proposals/bulk-memory-operations/Overview.md ``` Segments can also be shrunk to size zero by using the following new instructions: - data.drop: discard the data in an data segment - elem.drop: discard the data in an element segment An active segment is equivalent to a passive segment, but with an implicit memory.init followed by a data.drop (or table.init followed by a elem.drop) that is prepended to the module's start function. ``` ps. https://webassembly.github.io/spec/core/bikeshed/#-hrefsyntax-instr-memorymathsfmemoryinitx%E2%91%A0 https://webassembly.github.io/spec/core/bikeshed/#-hrefsyntax-instr-tablemathsftableinitxy%E2%91%A0 https://github.com/bytecodealliance/wasm-micro-runtime/issues/3020	2024-01-26 09:45:59 +08:00
Wenyong Huang	a7545df5d0	classic-interp: Handle SIMD opcode when JIT is enabled (#3046 ) Though SIMD isn't supported by interpreter, when JIT is enabled, developer may run `iwasm --interp <wasm_file>` to trigger the SIMD opcode in interpreter, which isn't handled before this PR.	2024-01-19 12:31:18 +08:00
Wenyong Huang	9bcf6b4dd3	Enable quick aot entry when hw bound check is disabled (#3044 ) - Enable quick aot entry when hw bound check is disabled - Remove unnecessary ret_type argument in the quick aot entries - Declare detailed prototype of aot function to call in each quick aot entry	2024-01-19 08:55:35 +08:00
liang.he	5c8b8a17a6	Enhancements on wasm function execution time statistic (#2985 ) Enhance the statistic of wasm function execution time, or the performance profiling feature: - Add os_time_thread_cputime_us() to get the cputime of a thread, and use it to calculate the execution time of a wasm function - Support the statistic of the children execution time of a function, and dump it in wasm_runtime_dump_perf_profiling - Expose two APIs: wasm_runtime_sum_wasm_exec_time wasm_runtime_get_wasm_func_exec_time And rename os_time_get_boot_microsecond to os_time_get_boot_us.	2024-01-17 09:51:54 +08:00
Wenyong Huang	7c7684819d	Register quick call entries to speedup the aot/jit func call process (#2978 ) In some scenarios there may be lots of callings to AOT/JIT functions from the host embedder, which expects good performance for the calling process, while in the current implementation, runtime calls the wasm_runtime_invoke_native to prepare the array of registers and stacks for the invokeNative assemble code, and the latter then puts the elements in the array to physical registers and native stacks and calls the AOT/JIT function, there may be many data copying and handlings which impact the performance. This PR registers some quick AOT/JIT entries for some simple wasm signatures, and let runtime call the entry to directly invoke the AOT/JIT function instead of calling wasm_runtime_invoke_native, which speedups the calling process. We may extend the mechanism next to allow the developer to register his quick AOT/JIT entries to speedup the calling process of invoking the AOT/JIT functions for some specific signatures.	2024-01-10 16:44:09 +08:00
Wenyong Huang	3637f2df79	Refine LLVM JIT function call process (#2925 ) - Don't allocate the implicit/unused frame when calling the LLVM JIT function - Don't set exec_env's thread handle and stack boundary in the recursive calling from host, since they have been set in the first time calling - Fix frame not freed in llvm_jit_call_func_bytecode	2024-01-02 18:46:02 +08:00
YAMAMOTO Takashi	18529253d8	interpreter: Simplify memory.grow a bit (#2899 )	2023-12-12 20:24:51 +08:00
Yage Hu	ef0cd22119	Fix memory size not updating after growing in interpreter (#2898 ) This commit fixes linear memory size not updating after growing. This causes `memory.fill` to throw an exception after `memory.grow`.	2023-12-12 08:36:59 +08:00
Enrico Loparco	0455071fc1	Access linear memory size atomically (#2834 ) Fixes: https://github.com/bytecodealliance/wasm-micro-runtime/issues/2804	2023-11-29 20:27:17 +08:00
Huang Qi	0b29904f26	Fix configurable bounds checks typo (#2809 )	2023-11-21 17:32:45 +08:00
YAMAMOTO Takashi	562a5dd1b6	Fix data/elem drop (#2747 ) Currently, `data.drop` instruction is implemented by directly modifying the underlying module. It breaks use cases where you have multiple instances sharing a single loaded module. `elem.drop` has the same problem too. This PR fixes the issue by keeping track of which data/elem segments have been dropped by using bitmaps for each module instances separately, and add a sample to demonstrate the issue and make the CI run it. Also add a missing check of dropped elements to the fast-jit `table.init`. Fixes: https://github.com/bytecodealliance/wasm-micro-runtime/issues/2735 Fixes: https://github.com/bytecodealliance/wasm-micro-runtime/issues/2772	2023-11-18 08:50:16 +08:00
YAMAMOTO Takashi	24c4d256b3	Grab `cluster->lock` when modifying `exec_env->module_inst` (#2685 ) Fixes: https://github.com/bytecodealliance/wasm-micro-runtime/issues/2680 And when switching back to the original module_inst, propagate exception if any. cf. https://github.com/bytecodealliance/wasm-micro-runtime/issues/2512	2023-11-09 18:56:02 +08:00
Wenyong Huang	4f5ad4dc12	Apply no_sanitize_address for clang compiler in several places (#2663 ) Apply `no_sanitize_address` for clang compiler in several places in which it has been applied to gcc compiler. And refine the comment.	2023-10-25 08:05:26 +08:00
funera1	64baf54d88	Fix label index out-of-range references in op_br_table_cache (#2615 ) Fixed a bug in the processing of the br_table_cache opcode that caused out-of-range references when the label index was greater than the length of the label.	2023-10-03 10:33:00 +08:00
Enrico Loparco	00539620e9	Improve stack trace dump and fix coding guideline CI (#2599 ) Avoid the stack traces getting mixed up together when multi-threading is enabled by using exception_lock/unlock in dumping the call stacks. And remove duplicated call stack dump in wasm_application.c. Also update coding guideline CI to fix the clang-format-12 not found issue.	2023-09-29 10:52:54 +08:00
YAMAMOTO Takashi	51714c41c0	Introduce WASMModuleInstanceExtraCommon (#2429 ) Move the common parts of WASMModuleInstanceExtra and AOTModuleInstanceExtra into the new structure.	2023-08-08 09:35:29 +08:00
YAMAMOTO Takashi	91592429f4	Fix memory sharing (#2415 ) - Inherit shared memory from the parent instance, instead of trying to look it up by the underlying module. The old method works correctly only when every cluster uses different module. - Use reference count in WASMMemoryInstance/AOTMemoryInstance to mark whether the memory is shared or not - Retire WASMSharedMemNode - For atomic opcode implementations in the interpreters, use a global lock for now - Update the internal API users (wasi-threads, lib-pthread, wasm_runtime_spawn_thread) Fixes https://github.com/bytecodealliance/wasm-micro-runtime/issues/1962	2023-08-04 10:18:13 +08:00
Wenyong Huang	59b2099b68	Fix some check issues on table operations (#2392 ) Fix some check issues on table.init, table.fill and table.copy, and unify the check method for all running modes. Fix issue #2390 and #2096.	2023-07-27 21:53:48 +08:00
Marcin Kolny	0f4edf9735	Implement suspend flags as atomic variable (#2361 ) We have observed a significant performance degradation after merging https://github.com/bytecodealliance/wasm-micro-runtime/pull/1991 Instead of protecting suspend flags with a mutex, we implement the flags as atomic variable and only use mutex when atomics are not available on a given platform.	2023-07-21 08:27:09 +08:00
YAMAMOTO Takashi	228a3bed53	Fix unused warnings on disable_bounds_checks (#2347 )	2023-07-06 15:31:22 +08:00
Huang Qi	18092f86cc	Make memory access boundary check behavior configurable (#2289 ) Allow to use `cmake -DWAMR_CONFIGURABLE_BOUNDS_CHECKS=1` to build iwasm, and then run `iwasm --disable-bounds-checks` to disable the memory access boundary checks. And add two APIs: `wasm_runtime_set_bounds_checks` and `wasm_runtime_is_bounds_checks_enabled`	2023-07-04 16:21:30 +08:00
Wenyong Huang	76be848ec3	Implement the segue optimization for LLVM AOT/JIT (#2230 ) Segue is an optimization technology which uses x86 segment register to store the WebAssembly linear memory base address, so as to remove most of the cost of SFI (Software-based Fault Isolation) base addition and free up a general purpose register, by this way it may: - Improve the performance of JIT/AOT - Reduce the footprint of JIT/AOT, the JIT/AOT code generated is smaller - Reduce the compilation time of JIT/AOT This PR uses the x86-64 GS segment register to apply the optimization, currently it supports linux and linux-sgx platforms on x86-64 target. By default it is disabled, developer can use the option below to enable it for wamrc and iwasm(with LLVM JIT enabled): ```bash wamrc --enable-segue=[<flags>] -o output_file wasm_file iwasm --enable-segue=[<flags>] wasm_file [args...] ``` `flags` can be: i32.load, i64.load, f32.load, f64.load, v128.load, i32.store, i64.store, f32.store, f64.store, v128.store Use comma to separate them, e.g. `--enable-segue=i32.load,i64.store`, and `--enable-segue` means all flags are added. Acknowledgement: Many thanks to Intel Labs, UC San Diego and UT Austin teams for introducing this technology and the great support and guidance! Signed-off-by: Wenyong Huang <wenyong.huang@intel.com> Co-authored-by: Vahldiek-oberwagner, Anjo Lucas <anjo.lucas.vahldiek-oberwagner@intel.com>	2023-05-26 10:13:33 +08:00
Zzzabiyaka	27239723a9	Add asan and ubsan to WAMR CI (#2161 ) Add nightly (UTC time) checks with asan and ubsan, and also put gcc-4.8 build to nightly run since we don't need to run it with every PR. Co-authored-by: Maksim Litskevich <makslit@amazon.co.uk>	2023-05-26 09:45:37 +08:00
Wenyong Huang	1e5f206464	Fix compile warnings on windows platform (#2208 )	2023-05-15 13:48:48 +08:00
Wenyong Huang	5fc48e3584	Fix interpreter read linear memory size for multi-threading (#2088 ) Load memory data size in each time memory access boundary check in multi-threading mode since it may be changed by other threads when memory growing. And use `memory->memory_data_size` instead of `memory->num_bytes_per_page * memory->cur_page_count` to refine the code.	2023-04-04 09:05:52 +08:00
Wenyong Huang	49d439a3bc	Fix/Simplify the atomic.wait/nofity implementations (#2044 ) Use the shared memory's shared_mem_lock to lock the whole atomic.wait and atomic.notify processes, and use it for os_cond_reltimedwait and os_cond_notify, so as to make the whole processes actual atomic operations: the original implementation accesses the wait address with shared_mem_lock and uses wait_node->wait_lock for os_cond_reltimedwait, which is not an atomic operation. And remove the unnecessary wait_map_lock and wait_lock, since the whole processes are already locked by shared_mem_lock.	2023-03-23 09:21:16 +08:00
Xu Jun	d75cb3224f	Fix dead lock in source debugger (#2040 )	2023-03-20 08:17:22 +08:00
Wenyong Huang	f279ba84ee	Fix multi-threading issues (#2013 ) - Implement atomic.fence to ensure a proper memory synchronization order - Destroy exec_env_singleton first in wasm/aot deinstantiation - Change terminate other threads to wait for other threads in wasm_exec_env_destroy - Fix detach thread in thread_manager_start_routine - Fix duplicated lock cluster->lock in wasm_cluster_cancel_thread - Add lib-pthread and lib-wasi-threads compilation to Windows CI	2023-03-08 10:57:22 +08:00
Enrico Loparco	e8d718096d	Add/reorganize locks for thread synchronization (#1995 ) Attempt to fix data races when using threads. - Protect access (from multiple threads) to exception and memory - Fix shared memory lock usage	2023-03-04 08:15:26 +08:00
Enrico Loparco	52e26e59cf	Add lock to protect the operations of accessing exec env (#1991 ) Data race may occur when accessing exec_env's fields, e.g. suspend_flags and handle. Add lock `exec_env->wait_lock` for them to resolve the issue.	2023-02-27 19:53:41 +08:00
Wenyong Huang	38c67b3f48	thread-mgr: Fix spread "wasi proc exit" exception and atomic.wait issues (#1988 ) Raising "wasi proc exit" exception, spreading it to other threads and then clearing it in all threads may result in unexpected behavior: the sub thread may end first, handle the "wasi proc exit" exception and clear exceptions of other threads, including the main thread. And when main thread's exception is cleared, it may continue to run and throw "unreachable" exception. This also leads to some assertion failed. Ignore exception spreading for "wasi proc exit" and don't clear exception of other threads to resolve the issue. And add suspend flag check after atomic wait since the atomic wait may be notified by other thread when exception occurs.	2023-02-24 20:05:39 +08:00
Wenyong Huang	ef3a683392	Don't call start/initialize in child thread's instantiation (#1967 ) The start/initialize functions of wasi module are to do some initialization work during instantiation, which should be only called one time in the instantiation of main instance. For example, they may initialize the data in linear memory, if the data is changed later by the main instance, and re-initialized again by the child instance, unexpected behaviors may occur. And clear a shadow warning in classic interpreter.	2023-02-17 15:11:05 +08:00
Enrico Loparco	216dc43ab4	Use shared memory lock for threads generated from same module (#1960 ) Multiple threads generated from the same module should use the same lock to protect the atomic operations. Before this PR, each thread used a different lock to protect atomic operations (e.g. atomic add), making the lock ineffective. Fix #1958.	2023-02-16 11:54:19 +08:00
Wenyong Huang	40a14b51c5	Enable running mode control for runtime and module instance (#1923 ) Enable setting running mode when executing a wasm bytecode file - Four running modes are supported: interpreter, fast-jit, llvm-jit and multi-tier-jit - Add APIs to set/get the default running mode of the runtime - Add APIs to set/get the running mode of a wasm module instance - Add running mode options for iwasm command line tool And add size/opt level options for LLVM JIT	2023-02-02 18:16:01 +08:00
YAMAMOTO Takashi	7d3b2a8773	Make memory profiling show native stack usage (#1917 )	2023-02-01 11:52:15 +08:00
Xu Jun	cadf9d0ad3	Main thread spread exception when thread-mgr is enabled (#1889 ) And refactor clear_wasi_proc_exit_exception, refer to https://github.com/bytecodealliance/wasm-micro-runtime/pull/1869	2023-01-20 08:54:27 +08:00
Xu Jun	e696ac36d7	Fix potential block issue in source debugger (#1887 ) Fix issue reported in #1860	2023-01-17 08:45:29 +08:00
Martin Klang	622cdbefd6	Prevent undefined behavior from c_api_func_imports == NULL (#1883 ) The module instance's c_api_func_imports may be NULL under some circumstances, add checks before accessing it.	2023-01-14 07:52:39 +08:00
Wenyong Huang	14288f59b0	Implement Multi-tier JIT (#1774 ) Implement 2-level Multi-tier JIT engine: tier-up from Fast JIT to LLVM JIT to get quick cold startup by Fast JIT and better performance by gradually switching to LLVM JIT when the LLVM JIT functions are compiled by the backend threads. Refer to: https://github.com/bytecodealliance/wasm-micro-runtime/issues/1302	2022-12-19 11:24:46 +08:00

1 2 3

118 Commits