Search

Technical Discussion Group Forum

This forum is provided for user discussion. While Beacon EmbeddedWorks support staff and engineers participate, Beacon EmbeddedWorks does not guarantee the accuracy of all information within in the Technical Discussion Group (TDG).

The "Articles" forums provide brief Articles written by Beacon EmbeddedWorks engineers that address the most frequently asked technical questions.

To receive email notifications when updates are posted for a Beacon EmbeddedWorks product download, please subscribe to the TDG Forum of interest.

TDG Forum

PrevPrev Go to previous topic
NextNext Go to next topic
Last Post 26 Feb 2013 04:36 AM by  jduran@gem-med.com
BSP 2.2: C6runlib performance.
 2 Replies
Sort:
You are not authorized to post a reply.
Author Messages
jduran@gem-med.com
New Member
New Member
Posts:79


--
21 Nov 2012 04:01 AM
    Dear all,

    As in the BSP 2.2 it is possible to build de the C6Run package, we've started to run the C6Runlib
    examples, to compare the performance of ARM vs DSP in calculations. Expecting to see better
    performance when calculations are done by DSP than ARM, we've found the oposite.

    Following there are the results of the execution of cfft example program:

    Quote:

    DM-37x# ./cfft_arm
    N=16,nTimes=100: 0.001007 s
    N=32,nTimes=100: 0.002045 s
    N=64,nTimes=100: 0.004791 s
    N=128,nTimes=100: 0.010772 s
    N=256,nTimes=100: 0.024628 s
    N=512,nTimes=100: 0.055725 s
    N=1024,nTimes=100: 0.124604 s
    N=2048,nTimes=100: 0.272583 s
    N=4096,nTimes=100: 0.596222 s
    N=8192,nTimes=100: 1.29132 s
    N=16384,nTimes=100: 2.74329 s
    DM-37x# ./cfft_dsp
    N=16,nTimes=100: 0.126648 s
    N=32,nTimes=100: 0.14206 s
    N=64,nTimes=100: 0.177978 s
    N=128,nTimes=100: 0.260376 s
    N=256,nTimes=100: 0.451263 s
    N=512,nTimes=100: 0.872955 s
    N=1024,nTimes=100: 1.81073 s
    N=2048,nTimes=100: 3.87048 s
    N=4096,nTimes=100: 8.3595 s
    N=8192,nTimes=100: 18.1759 s
    N=16384,nTimes=100: 39.5378 s
    Segmentation fault


    Thinking that in BSP, no NEON nor CortexA8 optimization options are provided, it is strange that DSP performance is worse than ARM performance. Do you get the same behavior in your tests?

    Thanks and Best Regards,
    Joaquim Duran
    jduran@gem-med.com
    New Member
    New Member
    Posts:79


    --
    14 Feb 2013 05:01 AM
    Any news with this issue?

    Joaquim Duran
    jduran@gem-med.com
    New Member
    New Member
    Posts:79


    --
    26 Feb 2013 04:36 AM
    You are not authorized to post a reply.