wasm-micro-runtime

mirror of https://github.com/bytecodealliance/wasm-micro-runtime.git synced 2025-06-18 02:59:21 +00:00

Author	SHA1	Message	Date
Xu Jun	1977ad23ef	fast-interp: Fix dynamic offset error issue in else branch (#3058 ) Reported in https://github.com/bytecodealliance/wasm-micro-runtime/issues/3026.	2024-01-19 19:58:12 +08:00
Wenyong Huang	b21f17dd6d	Refine AOT/JIT code call wasm-c-api import process (#2982 ) Allow to invoke the quick call entry wasm_runtime_quick_invoke_c_api_import to call the wasm-c-api import functions to speedup the calling process, which reduces the data copying. Use `wamrc --invoke-c-api-import` to generate the optimized AOT code, and set `jit_options->quick_invoke_c_api_import` true in wasm_engine_new when LLVM JIT is enabled.	2024-01-10 18:37:02 +08:00
Wenyong Huang	7c7684819d	Register quick call entries to speedup the aot/jit func call process (#2978 ) In some scenarios there may be lots of callings to AOT/JIT functions from the host embedder, which expects good performance for the calling process, while in the current implementation, runtime calls the wasm_runtime_invoke_native to prepare the array of registers and stacks for the invokeNative assemble code, and the latter then puts the elements in the array to physical registers and native stacks and calls the AOT/JIT function, there may be many data copying and handlings which impact the performance. This PR registers some quick AOT/JIT entries for some simple wasm signatures, and let runtime call the entry to directly invoke the AOT/JIT function instead of calling wasm_runtime_invoke_native, which speedups the calling process. We may extend the mechanism next to allow the developer to register his quick AOT/JIT entries to speedup the calling process of invoking the AOT/JIT functions for some specific signatures.	2024-01-10 16:44:09 +08:00
Xu Jun	f96257bade	Fix fast-interp polymorphic stack processing (#2974 ) Fix issue #2951, #2952 and #2953.	2024-01-04 10:00:36 +08:00
Wenyong Huang	1ee4767d97	Fix ref.func function declared check in wasm loader (#2972 ) The forward-declare function reference in ref.func can be declared in table element segments, no matter whether the segment mode is passive, active or declarative. Reported in https://github.com/bytecodealliance/wasm-micro-runtime/issues/2944.	2024-01-03 11:43:03 +08:00
Xu Jun	d818672f62	Fix ref.is_null processing in fast-interp loader (#2971 )	2024-01-02 18:10:01 +08:00
liang.he	5c3ad0279a	Enable AOT linux perf support (#2930 ) And refactor the original perf support - use WAMR_BUILD_LINUX_PERF as the cmake compilation control - use WASM_ENABLE_LINUX_PERF as the compiler macro - use `wamrc --enable-linux-perf` to generate aot file which contains fp operations - use `iwasm --enable-linux-perf` to create perf map for `perf record`	2024-01-02 15:58:17 +08:00
Xu Jun	53c3fa27d4	Fix block with type issue in fast interp (#2866 ) Reported in https://github.com/bytecodealliance/wasm-micro-runtime/issues/2863.	2023-12-05 17:09:05 +08:00
Wenyong Huang	23c1343fb3	Fix wasm loader handle op_br_table and op_drop (#2864 ) - Fix op_br_table arity type check when the dest block is loop block - Fix op_drop issue when the stack is polymorphic and it is to drop an ANY type value in the stack	2023-12-05 16:59:13 +08:00
liang.he	8aa813f44a	Generate jitdump to support linux perf for LLVM JIT (#2788 )	2023-11-27 15:42:00 +08:00
Wenyong Huang	d6bba13e86	Fix fast-interp "pre-compiled label offset out of range" issue (#2659 ) When labels-as-values is enabled in a target which doesn't support unaligned address access, 16-bit offset is used to store the relative offset between two opcode labels. But it is a little small and the loader may report "pre-compiled label offset out of range" error. Emitting 32-bit data instead to resolve the issue: emit label address in 32-bit target and emit 32-bit relative offset in 64-bit target. See also: https://github.com/bytecodealliance/wasm-micro-runtime/issues/2635	2023-10-24 10:47:17 +08:00
Wenyong Huang	6382162711	Fix loader push_pop_frame_ref_offset (#2590 ) `wasm_loader_push_pop_frame_offset` may pop n operands by using `loader_ctx->stack_cell_num` to check whether the operand can be popped or not. While `loader_ctx->stack_cell_num` is updated in the later `wasm_loader_push_pop_frame_ref`, the check may fail if the stack is in polymorphic state and lead to `ctx->frame_offset` underflow. Fix issue #2577 and #2586.	2023-09-26 10:17:54 +08:00
Xu Jun	7baaed2fb8	Fix opcode overwrite issue in fast interp (#2476 )	2023-08-17 19:49:35 +08:00
Wenyong Huang	edea32b629	Fix result arity check on select_t opcode (#2406 ) Typed select must have exactly one result. Reported in issue #2402.	2023-07-31 18:20:11 +08:00
Wenyong Huang	76be848ec3	Implement the segue optimization for LLVM AOT/JIT (#2230 ) Segue is an optimization technology which uses x86 segment register to store the WebAssembly linear memory base address, so as to remove most of the cost of SFI (Software-based Fault Isolation) base addition and free up a general purpose register, by this way it may: - Improve the performance of JIT/AOT - Reduce the footprint of JIT/AOT, the JIT/AOT code generated is smaller - Reduce the compilation time of JIT/AOT This PR uses the x86-64 GS segment register to apply the optimization, currently it supports linux and linux-sgx platforms on x86-64 target. By default it is disabled, developer can use the option below to enable it for wamrc and iwasm(with LLVM JIT enabled): ```bash wamrc --enable-segue=[<flags>] -o output_file wasm_file iwasm --enable-segue=[<flags>] wasm_file [args...] ``` `flags` can be: i32.load, i64.load, f32.load, f64.load, v128.load, i32.store, i64.store, f32.store, f64.store, v128.store Use comma to separate them, e.g. `--enable-segue=i32.load,i64.store`, and `--enable-segue` means all flags are added. Acknowledgement: Many thanks to Intel Labs, UC San Diego and UT Austin teams for introducing this technology and the great support and guidance! Signed-off-by: Wenyong Huang <wenyong.huang@intel.com> Co-authored-by: Vahldiek-oberwagner, Anjo Lucas <anjo.lucas.vahldiek-oberwagner@intel.com>	2023-05-26 10:13:33 +08:00
Wenyong Huang	e1d0c27ef9	Fix ref.func forward-declared function check (#2099 ) When ref.func opcode refers to a function whose function index no smaller than current function, the destination func should be forward-declared: it is declared in the table element segments, or is declared in the export list.	2023-04-03 15:55:24 +08:00
Wenyong Huang	605c8b07dc	Fix issue of Multi-tier JIT (#2056 )	2023-03-25 11:15:05 +08:00
Enrico Loparco	216dc43ab4	Use shared memory lock for threads generated from same module (#1960 ) Multiple threads generated from the same module should use the same lock to protect the atomic operations. Before this PR, each thread used a different lock to protect atomic operations (e.g. atomic add), making the lock ineffective. Fix #1958.	2023-02-16 11:54:19 +08:00
Wenyong Huang	40a14b51c5	Enable running mode control for runtime and module instance (#1923 ) Enable setting running mode when executing a wasm bytecode file - Four running modes are supported: interpreter, fast-jit, llvm-jit and multi-tier-jit - Add APIs to set/get the default running mode of the runtime - Add APIs to set/get the running mode of a wasm module instance - Add running mode options for iwasm command line tool And add size/opt level options for LLVM JIT	2023-02-02 18:16:01 +08:00
YAMAMOTO Takashi	7d3b2a8773	Make memory profiling show native stack usage (#1917 )	2023-02-01 11:52:15 +08:00
Wenyong Huang	0090d3e3fc	Fix issue of resolving func name in custom name section (#1849 ) Should use import_function_count but not import_count to calculate the func_index in handle_name_section when custom name section feature is enabled. And clear the compile warnings of mini loader.	2022-12-30 14:37:04 +08:00
Wenyong Huang	14288f59b0	Implement Multi-tier JIT (#1774 ) Implement 2-level Multi-tier JIT engine: tier-up from Fast JIT to LLVM JIT to get quick cold startup by Fast JIT and better performance by gradually switching to LLVM JIT when the LLVM JIT functions are compiled by the backend threads. Refer to: https://github.com/bytecodealliance/wasm-micro-runtime/issues/1302	2022-12-19 11:24:46 +08:00
Wenyong Huang	1652f22a77	Fix issues reported by Coverity (#1775 ) Fix some issues reported by Coverity and fix windows exception check with guard page issue	2022-12-01 19:24:13 +08:00
Wenyong Huang	96570cca22	Remove unused LLVM JIT wapper functions (#1747 ) Only create the necessary wrapper functions for LLVM JIT	2022-11-25 11:26:08 +08:00
Wenyong Huang	da7117a092	Refine the stack frame size check in interpreter (#1730 ) Limit max_stack_cell_num/max_csp_num to be no larger than UINT16_MAX, and don't check all_cell_num in interpreter again. And refine some codes in interpreter.	2022-11-22 15:32:48 +08:00
Wenyong Huang	c70e1ebc3d	Avoid generating some unused LLVM IRs (#1696 ) Refine the generated LLVM IRs at the beginning of each LLVM AOT/JIT function to fasten the LLVM IR optimization: - Only create argv_buf if there are func calls in this function - Only create native stack bound if stack bound check is enabled - Only create aux stack info if there is opcode set_global_aux_stack - Only create native symbol if indirect_mode is enabled - Only create memory info if there are memory operations - Only create func_type_indexes if there is opcode call_indirect	2022-11-14 14:32:35 +08:00
Wenyong Huang	e87a554616	Refactor LLVM JIT (#1613 ) Refactor LLVM JIT for some purposes: - To simplify the source code of JIT compilation - To simplify the JIT modes - To align with LLVM latest changes - To prepare for the Multi-tier JIT compilation, refer to #1302 The changes mainly include: - Remove the MCJIT mode, replace it with ORC JIT eager mode - Remove the LLVM legacy pass manager (only keep the LLVM new pass manager) - Change the lazy mode's LLVM module/function binding: change each function in an individual LLVM module into all functions in a single LLVM module - Upgraded ORC JIT to ORCv2 JIT to enable lazy compilation Refer to #1468	2022-10-18 20:17:34 +08:00
Wenyong Huang	a182926a73	Refactor interpreter/AOT module instance layout (#1559 ) Refactor the layout of interpreter and AOT module instance: - Unify the interp/AOT module instance, use the same WASMModuleInstance/ WASMMemoryInstance/WASMTableInstance data structures for both interpreter and AOT - Make the offset of most fields the same in module instance for both interpreter and AOT, append memory instance structure, global data and table instances to the end of module instance for interpreter mode (like AOT mode) - For extra fields in WASM module instance, use WASMModuleInstanceExtra to create a field `e` for interpreter - Change the LLVM JIT module instance creating process, LLVM JIT uses the WASM module and module instance same as interpreter/Fast-JIT mode. So that Fast JIT and LLVM JIT can access the same data structures, and make it possible to implement the Multi-tier JIT (tier-up from Fast JIT to LLVM JIT) in the future - Unify some APIs: merge some APIs for module instance and memory instance's related operations (only implement one copy) Note that the AOT ABI is same, the AOT file format, AOT relocation types, how AOT code accesses the AOT module instance and so on are kept unchanged. Refer to: https://github.com/bytecodealliance/wasm-micro-runtime/issues/1384	2022-10-18 10:59:28 +08:00
Wenyong Huang	64c0b15c52	loader: Sub local count can be 0 (#1504 ) Sub local count is allowed to be 0 in each group of function local types.	2022-09-20 12:40:24 +08:00
Wenyong Huang	ab929c20a3	Add check for code section size, fix interp float operations (#1480 ) And enable classic interpreter instead fast interpreter when llvm jit is enabled, so as to fix the issue that llvm jit cannot handle opcode drop_64/select_64.	2022-09-14 19:49:18 +08:00
Wenyong Huang	8a7dd4dc3e	Remove handling unsupported opcodes in loader (#1464 ) Remove handling opcode DROP_64/SELECT_64 in loader stage prepare_bytecode, as they are the modified opcodes of DROP/SELECT for optimization purpose, but not the opcodes defined by spec.	2022-09-08 15:38:16 +08:00
Wenyong Huang	ccd627d2c6	Fix linear memory page count issues (#1380 ) Fix issue reported in #1289 and #1371. Enable to set the max page count to 65536.	2022-08-23 16:05:13 +08:00
FromLiQg	a382a02ea9	Fix wasm_type_equal check in wasm_mini_loader.c (#1394 ) Fix wasm_type_equal check error in wasm_mini_loader.c: wasm_type_equal(type, j) -> wasm_type_equal(type, module->types[j]) And remove unused comments in aot_runtime.h	2022-08-19 12:56:24 +08:00
FromLiQg	88bb4f3c81	Normalize wasm types (#1378 ) Normalize wasm types, for the two wasm types, if their parameter types and result types are the same, we only save one copy, so as to reduce the footprint and simplify the type comparison in opcode CALL_INDIRECT. And fix issue in interpreter globals_instantiate, and remove used codes.	2022-08-18 17:52:02 +08:00
Xu Jun	3b641b17d8	Reserve one pointer size for fast-interp code_compiled_size (#1382 ) Reserve one pointer size for fast-interp code_compiled_size: if the last opcode of current function is to be dropped (e.g. OP_DROP), the peak memory usage will be larger than the final code_compiled_size, we record the peak size to ensure there won't be invalid memory access during the second traversing.	2022-08-15 11:33:20 +08:00
Xu Jun	872cc51881	Fix mini-loader issue (#1383 )	2022-08-12 16:35:57 +08:00
Wenyong Huang	1fff8d5cbc	Fix wasm loader issues (#1363 ) Should not clear last label's polymorphic state after current label is popped Fix invalid func_idx check in opcode REF_FUNC Add check when there are extra unneeded bytecodes for a wasm function	2022-08-08 13:22:23 +08:00
Wenyong Huang	bf28030993	Import WAMR Fast JIT (#1343 ) Import WAMR Fast JIT which is a lightweight JIT with quick startup, small footprint, relatively good performance (~40% to ~50% of LLVM JIT) and good portability. Platforms supported: Linux, MacOS and Linux SGX. Arch supported: x86-64.	2022-08-02 16:03:50 +08:00
yaozhongxiao	efc8bc10a9	[bugfix] initialize "module->retain_function" for wasm_mini_loader (#1333 ) Before resolving the module function's export in wasm_mini_loader, "module->retain_function" need to be initialized, otherwise, the "__new" function export will lead to abort. issue: https://github.com/bytecodealliance/wasm-micro-runtime/issues/1332 Co-authored-by: yaozhongxiao <yaozhongxiao@bytedance.com>	2022-07-27 18:01:20 +08:00
Xu Jun	188d5e70e9	Fix typo in wasm_mini_loader.c (#1232 )	2022-06-16 12:07:32 +08:00
Xu Jun	b39f4c5c9b	Fix drop opcode issue in fast interpreter (#1231 ) Fix fast interpreter issue reported in #1230	2022-06-16 09:51:01 +08:00
Wenyong Huang	d62543c99c	Enlarge max pool size and fix bh_memcpy_s dest max size check (#1151 ) Enlarge max pool size and fix bh_memcpy_s dest max size check to support large linear memory, e.g. with initial page count 65535.	2022-05-07 16:09:16 +08:00
Wenyong Huang	adaaf348ed	Refine opcode br_table for classic interpreter (#1112 ) Refine opcode br_table for classic interpreter as there may be a lot of leb128 decoding when the br count is big: 1. Use the bytecode itself to store the decoded leb br depths if each decoded depth can be stored with one byte 2. Create br_table cache to store the decode leb br depths if the decoded depth cannot be stored with one byte After the optimization, the class interpreter can access the br depths array with index, no need to decode the leb128 again. And fix function record_fast_op() return value unchecked issue in source debugging feature.	2022-04-23 19:15:55 +08:00
Wenyong Huang	d6e781af28	Add more operand stack overflow checks for fast-interp (#1104 ) And clear some compile warnings on Windows	2022-04-20 16:19:12 +08:00
Xu Jun	fd9cce0eef	Add fast interpreter offset overflow check (#1076 ) * check fast interpreter offset overflow	2022-04-07 21:07:32 +08:00
Xu Jun	f0dc6a3015	Fix fast interpreter constant space overflow issue (#1071 ) Fix the potential integer overflow of const index in const space of fast interpreter, emit i32/i64.const opcode when the const index is larger than INT32_MAX. And add check for the function local cell num.	2022-04-04 07:55:37 +08:00
Wenyong Huang	7262aebf77	Fix issues found by GC and Fast JIT, refine some codes (#1055 ) Fix handle OP_TABLE_COPY issue Fix loader handle OP_BLOCK/IF/LOOP issue if type_index is larger than 256 Fix loader handle OP_GET_GLOBAL, allow to change to GET_GLOBAL_64 for aot compiler similiar to handling OP_SET_GLOBAL Refine loader handle OP_GET/SET/TEE_LOCAL, disable changing opcode when source debugging is enabled, so as no need to record the change of opcode Refine wasm_interp_interp_frame_size to reduce the wasm operand stack usage Signed-off-by: Wenyong Huang <wenyong.huang@intel.com>	2022-03-24 14:14:42 +08:00
Wenyong Huang	b6e5206e61	Fix wasm_runtime_load argument type invalid issue (#1059 ) Remove the `const` flag for the first argument `buf` of wasm_runtime_load as it might be modified by runtime for footprint and performance purpose, and update the related functions and document.	2022-03-24 10:08:49 +08:00
liang.he	50b6474f54	Add WASI ABI compatibility check for multi-module (#913 ) Refer to https://github.com/WebAssembly/WASI/blob/main/design/application-abi.md to check the WASI ABI compatibility: - Command (main module) may export _start function with signature "()" - Reactor (sub module) may export _initialize function with signature "()" - _start and _initialize can not be exported at the same time - Reactor cannot export _start function - Command and Reactor must export memory And - Rename module->is_wasi_module to module->import_wasi_api - Refactor wasm_loader_find_export() - Remove MULTI_MODULE related codes from mini_loader - Update multi-module samples - Fix a "use-after-free" issue. Since we reuse the memory instance of sub module, just to protect it from freeing an imported memory instance	2021-12-29 11:04:36 +08:00
Wenyong Huang	5547924e28	Refine codes and fix several issues (#882 ) Refine some codes in wasm loader Add -Wshadow to gcc compile flags and fix some variable shadowed issues Fix function parameter/return types not checked issue Fix fast-interp loader reserve_block_ret() not handle V128 return type issue Fix mini loader load_table_segment_section() failed issue Add detailed comments for argc argument in wasm_runtime_call_wasm()	2021-12-10 18:13:17 +08:00

1 2

89 Commits