discuss-gnuradio
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Discuss-gnuradio] Two questions re VOLK armv7


From: Dennis Glatting
Subject: Re: [Discuss-gnuradio] Two questions re VOLK armv7
Date: Sat, 04 Jul 2015 18:07:51 -0700

On Sat, 2015-07-04 at 16:32 -0400, West, Nathan wrote:
> Hi Dennis,
> 
> 
> The 32fc_s32fc_multiply_32fc should be fixed as of about 12 hours ago.
> Try updating your repo.
> 

Thanks. I did a pull and recompiled. Works better. The test now says:

...
Kernel QA finished: 2 failures out of 91 tests.
The following kernels failed QA:
    volk_32fc_magnitude_32f
    volk_32f_sqrt_32f


This is my ToolChain file, BTW: 

address@hidden:/mnt/volk# cat cmake/Toolchains/armv7.cmake 
########################################################################
# Toolchain file for building native on a ARM Cortex A8 w/ NEON
# Usage: cmake -DCMAKE_TOOLCHAIN_FILE=<this file> <source directory>
########################################################################
#
# Dennis Glatting, July 2015

SET(CMAKE_CXX_FLAGS_INIT "-march=armv7 -mfpu=neon -mfloat-abi=hard")
SET(CMAKE_C_FLAGS_INIT   "-march=armv7 -mfpu=neon -mfloat-abi=hard")


I am native compiling on a CubieBoard4:

address@hidden:/mnt/volk# uname -a
Linux cubieboard4 3.4.39 #18 SMP PREEMPT Tue Apr 21 15:01:22 CST 2015
armv7l GNU/Linux







> 
> The other two have nothing 'wrong' with them. We use a QA tolerance to
> check results and it just happens that the tolerance is a bit too low
> on ARM. Those operations are hard to get accurate and fast because
> they involve a square root. NEON happens to have an inverse square
> root estimate which we use because it's the fastest way to get a
> result. Unfortunately whenever we have to use the inverse square root
> estimate from NEON the accuracy drops a bit. In these cases since we
> actually care about the sqiare root we use the NEON inverse estimate
> which is also not particularly accurate. The two together make any
> square root-like operations in VOLK not particularly accurate in NEON.
> On the plus side it's probably accurate enough for radio.
> 
> 
> Nathan
> 
> 
> On Fri, Jul 3, 2015 at 11:32 PM, Dennis Glatting <address@hidden>
> wrote:
>         
>         1: To get VOLK to compile correctly under gnuradio, I need to
>         include
>         -march=armv7 in CFLAGS (volk/lib/CMakeLists.txt) which I do
>         with a tool
>         chain file with volk git'd outside gnuradio.
>         
>                 address@hidden:/mnt/volk# rm CMakeCache.txt; cmake
>         
>         -DCMAKE_USER_MAKE_RULES_OVERRIDE=cmake/Toolchains/armv7.cmake .
>         
>         What is the best way to ensure -march=armv7 gets passed to
>         cmake when
>         building gnuradio? Specify CFLAGS on the command line or the
>         environment?
>         
>         GCC says:
>         
>         address@hidden:/mnt/volk# gcc -v
>         Using built-in specs.
>         COLLECT_GCC=gcc
>         COLLECT_LTO_WRAPPER=/usr/lib/gcc/arm-linux-gnueabihf/4.9/lto-wrapper
>         Target: arm-linux-gnueabihf
>         Configured with: ../src/configure -v --with-pkgversion='Debian
>         4.9.2-10'
>         --with-bugurl=file:///usr/share/doc/gcc-4.9/README.Bugs
>         --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++
>         --prefix=/usr
>         --program-suffix=-4.9 --enable-shared --enable-linker-build-id
>         --libexecdir=/usr/lib --without-included-gettext
>         --enable-threads=posix
>         --with-gxx-include-dir=/usr/include/c++/4.9 --libdir=/usr/lib
>         --enable-nls --with-sysroot=/ --enable-clocale=gnu
>         --enable-libstdcxx-debug --enable-libstdcxx-time=yes
>         --enable-gnu-unique-object --disable-libitm
>         --disable-libquadmath
>         --enable-plugin --with-system-zlib --disable-browser-plugin
>         --enable-java-awt=gtk --enable-gtk-cairo
>         --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.9-armhf/jre
>         --enable-java-home
>         --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.9-armhf
>         --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.9-armhf
>         --with-arch-directory=arm
>         --with-ecj-jar=/usr/share/java/eclipse-ecj.jar
>         --enable-objc-gc --enable-multiarch --disable-sjlj-exceptions
>         --with-arch=armv7-a --with-fpu=vfpv3-d16 --with-float=hard
>         --with-mode=thumb --enable-checking=release
>         --build=arm-linux-gnueabihf
>         --host=arm-linux-gnueabihf --target=arm-linux-gnueabihf
>         Thread model: posix
>         gcc version 4.9.2 (Debian 4.9.2-10)
>         
>         
>         2: Three QA tests fail:
>         
>         The following kernels failed QA:
>         
>             volk_32fc_magnitude_32f
>             volk_32f_sqrt_32f
>             volk_32fc_s32fc_multiply_32fc
>         
>         How do I go about fixing that? I included test output below.
>         
>         
>         address@hidden:/mnt/volk# ./lib/test_all
>         Using Volk machine: neon_hardfp
>         RUN_VOLK_TESTS: volk_64u_popcntpuppet_64u(131071,1)
>         generic completed in 0.993ms
>         neon completed in 1.402ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_16u_byteswappuppet_16u(131071,1)
>         generic completed in 0.478ms
>         neon completed in 0.266ms
>         neon_table completed in 0.271ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32u_byteswappuppet_32u(131071,1)
>         generic completed in 0.762ms
>         neon completed in 0.659ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32u_popcntpuppet_32u(131071,1)
>         generic completed in 0.661ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_64u_byteswappuppet_64u(131071,1)
>         generic completed in 1.621ms
>         neon completed in 1.136ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32fc_s32fc_rotatorpuppet_32fc(131071,1)
>         generic completed in 11.884ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_8u_conv_k7_r2puppet_8u(131071,0)
>         generic completed in 0.002ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_fm_detectpuppet_32f(131071,1)
>         generic completed in 2.371ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_16ic_s32f_deinterleave_real_32f(131071,1)
>         generic completed in 0.741ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_16ic_deinterleave_real_8i(131071,1)
>         generic completed in 0.509ms
>         neon completed in 0.404ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_16ic_deinterleave_16i_x2(131071,1)
>         generic completed in 0.56ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_16ic_s32f_deinterleave_32f_x2(131071,1)
>         generic completed in 1.199ms
>         neon completed in 1.11ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_16ic_deinterleave_real_16i(131071,1)
>         generic completed in 0.34ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_16ic_magnitude_16i(131071,1)
>         generic completed in 2.14ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_16ic_s32f_magnitude_32f(131071,1)
>         generic completed in 1.413ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_16i_s32f_convert_32f(131071,1)
>         generic completed in 0.32ms
>         neon completed in 0.267ms
>         a_generic completed in 0.25ms
>         Best aligned arch: a_generic
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_16i_convert_8i(131071,1)
>         generic completed in 0.218ms
>         neon completed in 0.23ms
>         a_generic completed in 0.21ms
>         Best aligned arch: a_generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_16i_32fc_dot_prod_32fc(131071,1)
>         generic completed in 0.916ms
>         neon completed in 0.773ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32f_accumulator_s32f(131071,1)
>         generic completed in 0.292ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_add_32f(131071,1)
>         generic completed in 0.652ms
>         u_neon completed in 0.607ms
>         a_generic completed in 0.601ms
>         Best aligned arch: a_generic
>         Best unaligned arch: u_neon
>         RUN_VOLK_TESTS: volk_32fc_32f_multiply_32fc(131071,1)
>         generic completed in 1.002ms
>         neon completed in 0.926ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32f_log2_32f(131071,1)
>         generic completed in 7.741ms
>         neon completed in 0.888ms
>         u_generic completed in 7.684ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32f_expfast_32f(131071,1)
>         generic completed in 27.81ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_pow_32f(131071,1)
>         generic completed in 33.018ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_sin_32f(131071,1)
>         generic completed in 4.454ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_cos_32f(131071,1)
>         generic completed in 5.466ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_tan_32f(131071,1)
>         generic completed in 8.962ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_atan_32f(131071,1)
>         generic completed in 7.27ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_asin_32f(131071,1)
>         u_generic completed in 12.883ms
>         Best aligned arch: u_generic
>         Best unaligned arch: u_generic
>         RUN_VOLK_TESTS: volk_32f_acos_32f(131071,1)
>         generic completed in 13.695ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_s32f_power_32fc(131071,1)
>         generic completed in 163.532ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS:
>         volk_32f_s32f_calc_spectral_noise_floor_32f(131071,1)
>         generic completed in 1.123ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_s32f_atan2_32f(131071,1)
>         generic completed in 14.414ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_x2_conjugate_dot_prod_32fc(131071,1)
>         generic completed in 1.273ms
>         neon completed in 1.033ms
>         a_generic completed in 1.175ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32fc_deinterleave_32f_x2(131071,1)
>         neon completed in 1.234ms
>         generic completed in 1.372ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32fc_deinterleave_64f_x2(131071,1)
>         generic completed in 1.932ms
>         a_generic completed in 2.05ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_s32f_deinterleave_real_16i(131071,1)
>         generic completed in 0.653ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_deinterleave_imag_32f(131071,1)
>         neon completed in 0.787ms
>         generic completed in 0.664ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_deinterleave_real_32f(131071,1)
>         generic completed in 0.809ms
>         neon completed in 0.669ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32fc_deinterleave_real_64f(131071,1)
>         generic completed in 0.918ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_x2_dot_prod_32fc(131071,1)
>         generic completed in 1.251ms
>         a_generic completed in 1.258ms
>         neon completed in 1.102ms
>         neon_opttests completed in 1.176ms
>         neon_optfma completed in 0.995ms
>         neon_optfmaunroll completed in 1.212ms
>         Best aligned arch: neon_optfma
>         Best unaligned arch: neon_optfma
>         RUN_VOLK_TESTS: volk_32fc_32f_dot_prod_32fc(131071,1)
>         generic completed in 0.934ms
>         neon_unroll completed in 0.862ms
>         a_neon completed in 0.868ms
>         a_neonasm completed in 0.837ms
>         a_neonpipeline completed in 0.687ms
>         Best aligned arch: a_neonpipeline
>         Best unaligned arch: neon_unroll
>         RUN_VOLK_TESTS: volk_32fc_index_max_16u(131071,1)
>         generic completed in 0.875ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_s32f_magnitude_16i(131071,1)
>         generic completed in 1.588ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_magnitude_32f(131071,1)
>         generic completed in 1.335ms
>         a_generic completed in 1.295ms
>         neon completed in 0.673ms
>         neon_fancy_sweet completed in 0.924ms
>         offset 0 in1: 0.705791 in2: 0.707031
>         offset 1 in1: 1.21055 in2: 1.20703
>         offset 2 in1: 0.453742 in2: 0.453125
>         offset 3 in1: 0.970181 in2: 0.96875
>         offset 4 in1: 1.05658 in2: 1.05469
>         offset 5 in1: 0.821474 in2: 0.818359
>         offset 6 in1: 0.862105 in2: 0.861328
>         offset 7 in1: 0.897716 in2: 0.896484
>         offset 8 in1: 0.796805 in2: 0.796875
>         offset 9 in1: 0.889502 in2: 0.886719
>         volk_32fc_magnitude_32f: fail on arch neon
>         offset 0 in1: 0.705791 in2: 0.710017
>         offset 1 in1: 1.21055 in2: 1.2123
>         offset 2 in1: 0.453742 in2: 0.451831
>         offset 3 in1: 0.970181 in2: 0.967088
>         offset 4 in1: 1.05658 in2: 1.04841
>         offset 5 in1: 0.821474 in2: 0.826919
>         offset 6 in1: 0.862105 in2: 0.868165
>         offset 7 in1: 0.897716 in2: 0.898518
>         offset 8 in1: 0.796805 in2: 0.804625
>         offset 9 in1: 0.889502 in2: 0.897857
>         volk_32fc_magnitude_32f: fail on arch neon_fancy_sweet
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         Failure on volk_32fc_magnitude_32f
>         RUN_VOLK_TESTS: volk_32fc_magnitude_squared_32f(131071,1)
>         generic completed in 0.672ms
>         neon completed in 0.641ms
>         a_generic completed in 0.669ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32fc_x2_multiply_32fc(131071,1)
>         generic completed in 1.774ms
>         a_generic completed in 1.738ms
>         neon completed in 1.218ms
>         neon_opttests completed in 1.189ms
>         neonasm completed in 1.173ms
>         Best aligned arch: neonasm
>         Best unaligned arch: neonasm
>         RUN_VOLK_TESTS: volk_32fc_x2_multiply_conjugate_32fc(131071,1)
>         generic completed in 1.82ms
>         neon completed in 1.253ms
>         a_generic completed in 1.794ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32fc_conjugate_32fc(131071,1)
>         generic completed in 0.736ms
>         a_neon completed in 0.749ms
>         a_generic completed in 0.653ms
>         Best aligned arch: a_generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_s32f_convert_16i(131071,1)
>         generic completed in 3.872ms
>         a_generic completed in 3.926ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_s32f_convert_32i(131071,1)
>         generic completed in 0.782ms
>         a_generic completed in 0.782ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_convert_64f(131071,1)
>         generic completed in 0.645ms
>         a_generic completed in 0.477ms
>         Best aligned arch: a_generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_s32f_convert_8i(131071,1)
>         generic completed in 2.2ms
>         a_generic completed in 2.111ms
>         Best aligned arch: a_generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_s32f_power_spectrum_32f(131071,1)
>         generic completed in 13.009ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_x2_square_dist_32f(131071,1)
>         neon completed in 0.738ms
>         generic completed in 0.686ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS:
>         volk_32fc_x2_s32f_square_dist_scalar_mult_32f(131071,1)
>         generic completed in 0.724ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_divide_32f(131071,1)
>         generic completed in 1.434ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_dot_prod_32f(131071,1)
>         generic completed in 0.592ms
>         a_generic completed in 0.575ms
>         neonopts completed in 0.553ms
>         neon completed in 0.574ms
>         neonasm completed in 0.433ms
>         neonasm_opts completed in 0.507ms
>         Best aligned arch: neonasm
>         Best unaligned arch: neonasm
>         RUN_VOLK_TESTS: volk_32f_x2_s32f_interleave_16ic(131071,1)
>         generic completed in 0.7ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_interleave_32fc(131071,1)
>         neon completed in 0.767ms
>         generic completed in 0.748ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_max_32f(131071,1)
>         neon completed in 0.631ms
>         generic completed in 0.596ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_min_32f(131071,1)
>         neon completed in 0.721ms
>         generic completed in 0.625ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_multiply_32f(131071,1)
>         generic completed in 0.737ms
>         neon completed in 0.625ms
>         a_generic completed in 0.611ms
>         Best aligned arch: a_generic
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32f_s32f_normalize(131071,1)
>         generic completed in 0.632ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_s32f_power_32f(131071,1)
>         generic completed in 35.731ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_sqrt_32f(131071,1)
>         neon completed in 0.534ms
>         generic completed in 4.204ms
>         offset 0 in1: 0.68207 in2: 0.681641
>         offset 2 in1: 0.892081 in2: 0.890625
>         offset 3 in1: 0.61199 in2: 0.609375
>         offset 6 in1: 0.406933 in2: 0.405273
>         offset 7 in1: 0.771893 in2: 0.769531
>         offset 9 in1: 0.839927 in2: 0.837891
>         offset 10 in1: 0.833134 in2: 0.832031
>         offset 11 in1: 0.844083 in2: 0.84375
>         offset 13 in1: 0.990661 in2: 0.990234
>         offset 14 in1: 0.879374 in2: 0.878906
>         volk_32f_sqrt_32f: fail on arch neon
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         Failure on volk_32f_sqrt_32f
>         RUN_VOLK_TESTS: volk_32f_s32f_stddev_32f(131071,1)
>         generic completed in 0.419ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_stddev_and_mean_32f_x2(131071,1)
>         generic completed in 0.441ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_subtract_32f(131071,1)
>         generic completed in 0.645ms
>         neon completed in 0.613ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32f_x3_sum_of_poly_32f(131071,1)
>         generic completed in 1.544ms
>         a_neon completed in 1.059ms
>         neonvert completed in 0.54ms
>         Best aligned arch: neonvert
>         Best unaligned arch: neonvert
>         RUN_VOLK_TESTS: volk_32i_x2_and_32i(131071,1)
>         neon completed in 0.62ms
>         generic completed in 0.588ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32i_s32f_convert_32f(131071,1)
>         generic completed in 0.432ms
>         a_generic completed in 0.445ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32i_x2_or_32i(131071,1)
>         neon completed in 0.728ms
>         generic completed in 0.7ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32f_x2_dot_prod_16i(131071,1)
>         generic completed in 0.54ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_64f_convert_32f(131071,1)
>         generic completed in 0.829ms
>         a_generic completed in 0.695ms
>         Best aligned arch: a_generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_64f_x2_max_64f(131071,1)
>         generic completed in 1.368ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_64f_x2_min_64f(131071,1)
>         generic completed in 1.371ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_8ic_deinterleave_16i_x2(131071,1)
>         generic completed in 0.419ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_8ic_s32f_deinterleave_32f_x2(131071,1)
>         generic completed in 0.686ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_8ic_deinterleave_real_16i(131071,1)
>         generic completed in 0.31ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_8ic_s32f_deinterleave_real_32f(131071,1)
>         generic completed in 0.409ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_8ic_deinterleave_real_8i(131071,1)
>         generic completed in 0.173ms
>         neon completed in 0.157ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_8ic_x2_multiply_conjugate_16ic(131071,1)
>         generic completed in 2.468ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS:
>         volk_8ic_x2_s32f_multiply_conjugate_32fc(131071,1)
>         generic completed in 2.21ms
>         Best aligned arch: generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_8i_convert_16i(131071,1)
>         generic completed in 0.137ms
>         a_generic completed in 0.112ms
>         neon completed in 0.115ms
>         Best aligned arch: a_generic
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_8i_s32f_convert_32f(131071,1)
>         generic completed in 0.187ms
>         a_generic completed in 0.179ms
>         Best aligned arch: a_generic
>         Best unaligned arch: generic
>         RUN_VOLK_TESTS: volk_32fc_s32fc_multiply_32fc(131071,1)
>         generic completed in 1.461ms
>         neon completed in 0.747ms
>         a_generic completed in 1.447ms
>         offset 0 in1: 182.179 + 36.3883j  in2: 146.351 + -160.164j
>         offset 1 in1: -60.14 + -277.288j  in2: -195.427 + -81.677j
>         offset 2 in1: 268.287 + 258.704j  in2: 205.207 + 82.4777j
>         offset 3 in1: 254.856 + 307.423j  in2: 0 + 0j
>         offset 5 in1: -277.526 + -318.87j  in2: -596.396 + -596.396j
>         offset 6 in1: -5.68741 + 178.404j  in2: -1.73927e-08 +
>         5.45578e-07j
>         offset 7 in1: -279.86 + 44.2968j  in2: 0 + 0j
>         offset 9 in1: -262.193 + -123.933j  in2: -386.127 + -386.127j
>         offset 10 in1: 57.8619 + -289.394j  in2: 1.76948e-07 +
>         -8.84997e-07j
>         offset 11 in1: -23.4022 + 307.941j  in2: 0 + 0j
>         volk_32fc_s32fc_multiply_32fc: fail on arch neon
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         Failure on volk_32fc_s32fc_multiply_32fc
>         RUN_VOLK_TESTS: volk_32f_s32f_multiply_32f(131071,1)
>         generic completed in 0.444ms
>         u_neon completed in 0.361ms
>         a_generic completed in 0.349ms
>         Best aligned arch: a_generic
>         Best unaligned arch: u_neon
>         RUN_VOLK_TESTS: volk_32f_binary_slicer_32i(131071,1)
>         generic completed in 0.538ms
>         generic_branchless completed in 0.532ms
>         Best aligned arch: generic_branchless
>         Best unaligned arch: generic_branchless
>         RUN_VOLK_TESTS: volk_32f_binary_slicer_8i(131071,1)
>         generic completed in 0.531ms
>         generic_branchless completed in 0.519ms
>         neon completed in 0.324ms
>         Best aligned arch: neon
>         Best unaligned arch: neon
>         RUN_VOLK_TESTS: volk_32f_tanh_32f(131071,1)
>         generic completed in 12.998ms
>         series completed in 1.83ms
>         Best aligned arch: series
>         Best unaligned arch: series
>         Kernel QA finished: 3 failures out of 91 tests.
>         The following kernels failed QA:
>             volk_32fc_magnitude_32f
>             volk_32f_sqrt_32f
>             volk_32fc_s32fc_multiply_32fc
>         
>         
>         
>         
>         
>         
>         
>         _______________________________________________
>         Discuss-gnuradio mailing list
>         address@hidden
>         https://lists.gnu.org/mailman/listinfo/discuss-gnuradio
> 
> 





reply via email to

[Prev in Thread] Current Thread [Next in Thread]