You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
daquexian 9395a01939 Add run_net_x86 configuration 4 years ago
..
all_v7 Add guard in unsupported functions 4 years ago
arm64-v8a Add comment for .dap_pm configurations 5 years ago
arm64-v8a-wo-test Add comment for .dap_pm configurations 5 years ago
bconv_test Add comment for .dap_pm configurations 5 years ago
bconv_test_x86 Update some configurations 5 years ago
benchmark Polish the benchmark result format and fix the wrong benchmark daq_pm 5 years ago
benchmark_macro Add comment for .dap_pm configurations 5 years ago
benchmark_v7 Add daq_pm benchmark_v7 configuration 4 years ago
bgemm_test Update some configurations 5 years ago
bgemm_test_v7 Fix configuration which doesn't enable neon 4 years ago
bgemm_test_x86 Replace __builtin_popcountl in armv7, since long is 32-bit here 5 years ago
bitpack_test Update some configurations 5 years ago
converter Add comment for .dap_pm configurations 5 years ago
converter_python onnx2bnn python package 5 years ago
fconv_test Add comment for .dap_pm configurations 5 years ago
gemm_test Add comment for .dap_pm configurations 5 years ago
im2col_test Add comment for .dap_pm configurations 5 years ago
jni Add comment for .dap_pm configurations 5 years ago
net_test [daq_pm] Build net_test for android-21 instead of android-28 4 years ago
net_test_debug Add three daq_pm configurations 5 years ago
net_test_v7 Fix configuration which doesn't enable neon 4 years ago
net_test_x86 1. Update CMakeLists.txt to support non-Android aarch64 devices 5 years ago
pad_test Add comment for .dap_pm configurations 5 years ago
run_net Add comment for .dap_pm configurations 5 years ago
run_net_debug Add three daq_pm configurations 5 years ago
run_net_new_bitpack Add three daq_pm configurations 5 years ago
run_net_x86 Add run_net_x86 configuration 4 years ago

我们推出了 dabnn,一个用 ARM 汇编重写了卷积,高度优化的二值网络 inference 框架。实验显示 dabnn 相对于现有的二值网络和浮点网络 inference 框架均能带来极大的加速。在运行单个二值卷积时,我们的速度是 BMXNet (除 dabnn 外唯一一个二值网络 inference 框架) 的 800%~2400%,在运行二值化的 ResNet-18 时,我们的速度是 BMXNet 的约 700%。

C++ C Markdown Text Shell other