octave-maintainers
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: conv2 performance


From: Jaroslav Hajek
Subject: Re: conv2 performance
Date: Wed, 3 Mar 2010 21:15:47 +0100

On Wed, Mar 3, 2010 at 7:28 PM, Michael D. Godfrey
<address@hidden> wrote:
> Jaroslav,
>
> Here is what I got for 3 runs using the matlab shown
> below, and following are 3 runs on the same machine
> (Linux qss.stanford.edu 2.6.31.12-174.2.3.fc12.x86_64 #1 SMP Mon Jan 18
> 19:52:07 UTC 2010 x86_64 x86_64 x86_64 GNU/Linux)
> using your latest patches as of today (3 March).
>

Incl. this one, I suppose:
http://hg.savannah.gnu.org/hgweb/octave/rev/5af0b4bb384d

> looks good!!
>
>>> version
>
> ans =
>
> 7.9.0.529 (R2009b)
>
>>> tocs1
>
> tocs1 =
>
>    0.0276    0.0376    0.0370    0.0403    0.0465    0.0541    0.0717
>  0.0713    0.0774    0.0851
>    0.0171    0.0304    0.0390    0.0459    0.0558    0.0647    0.0694
>  0.0815    0.0890    0.0926
>    0.0256    0.0304    0.0498    0.0555    0.0519    0.0757    0.0796
>  0.0855    0.1098    0.0955
>    0.0206    0.0435    0.0410    0.0456    0.0577    0.0731    0.0768
>  0.0858    0.0979    0.1039
>    0.0233    0.0327    0.0429    0.0850    0.0843    0.0820    0.0864
>  0.0930    0.1106    0.1186
>    0.0196    0.0399    0.0468    0.0544    0.0703    0.0829    0.0914
>  0.1060    0.1189    0.1447
>    0.0255    0.0403    0.0525    0.0652    0.0803    0.0916    0.1039
>  0.1119    0.1313    0.1442
>    0.0227    0.0404    0.0661    0.0675    0.0826    0.0996    0.1085
>  0.1263    0.1413    0.1519
>    0.0293    0.0374    0.0585    0.0753    0.0892    0.1061    0.1205
>  0.1337    0.1545    0.1747
>    0.0244    0.0455    0.0624    0.0748    0.0962    0.1151    0.1263
>  0.1457    0.1645    0.1767
>
>>> tocs2
>
> tocs2 =
>
>    0.0215    0.0215    0.0357    0.0302    0.0466    0.0481    0.0565
>  0.0631    0.0734    0.0633
>    0.0197    0.0198    0.0328    0.0341    0.0460    0.0473    0.0595
>  0.0608    0.0733    0.0741
>    0.0201    0.0223    0.0364    0.0390    0.0528    0.0554    0.0694
>  0.0866    0.0961    0.0877
>    0.0211    0.0245    0.0404    0.0439    0.0591    0.0622    0.0776
>  0.0820    0.0994    0.0998
>    0.0222    0.0294    0.0440    0.0484    0.0651    0.0694    0.0866
>  0.0914    0.1080    0.1133
>    0.0237    0.0294    0.0482    0.0538    0.0724    0.0772    0.0967
>  0.1015    0.1192    0.1246
>    0.0247    0.0316    0.0515    0.0582    0.0779    0.0857    0.1043
>  0.1111    0.1308    0.1375
>    0.0260    0.0347    0.0578    0.0629    0.0840    0.0918    0.1267
>  0.1214    0.1430    0.1502
>    0.0295    0.0379    0.0587    0.0711    0.0904    0.0999    0.1204
>  0.1313    0.1522    0.1644
>    0.0285    0.0394    0.0637    0.0729    0.1022    0.1074    0.1308
>  0.1414    0.1654    0.1748
>
>>> tocs3
>
> tocs3 =
>
>    0.0102    0.0191    0.0217    0.0359    0.0333    0.0373    0.0495
>  0.0498    0.0539    0.0659
>    0.0120    0.0172    0.0309    0.0322    0.0371    0.0510    0.0522
>  0.0584    0.0721    0.0738
>    0.0115    0.0196    0.0279    0.0449    0.0479    0.0527    0.0686
>  0.0710    0.0841    0.0873
>    0.0124    0.0226    0.0317    0.0489    0.0578    0.0634    0.0808
>  0.0813    0.0965    0.0988
>    0.0136    0.0242    0.0352    0.0541    0.0577    0.0673    0.0859
>  0.0900    0.1081    0.1271
>    0.0147    0.0268    0.0399    0.0591    0.0653    0.0835    0.0886
>  0.1066    0.1118    0.1299
>    0.0175    0.0293    0.0509    0.0575    0.0772    0.0840    0.1038
>  0.1100    0.1294    0.1363
>    0.0253    0.0341    0.0460    0.0685    0.0765    0.0974    0.1052
>  0.1261    0.1340    0.1554
>    0.0202    0.0361    0.0583    0.0679    0.0908    0.1002    0.1222
>  0.1306    0.1523    0.1619
>    0.0278    0.0381    0.0617    0.0720    0.0956    0.1059    0.1294
>  0.1396    0.1631    0.1735
>
>>>
>
> octave:8> version
> ans = 3.3.50+
> octave:9> tocs1
> tocs1 =
>
>   0.019214   0.010567   0.013155   0.019712   0.015009   0.016192   0.020595
>   0.021002   0.023537   0.027402
>   0.012689   0.020129   0.022437   0.016747   0.021009   0.024248   0.025196
>   0.029792   0.034801   0.037593
>   0.011220   0.016862   0.024116   0.029527   0.025354   0.031000   0.035506
>   0.039730   0.043879   0.048756
>   0.015511   0.022897   0.028157   0.027788   0.031833   0.037788   0.043802
>   0.047803   0.054861   0.059986
>   0.013858   0.021702   0.031242   0.039183   0.037245   0.044698   0.051554
>   0.057888   0.065361   0.072435
>   0.017494   0.028116   0.035032   0.037109   0.043360   0.051838   0.059653
>   0.066827   0.076396   0.086455
>   0.022556   0.023629   0.031844   0.048084   0.048953   0.058864   0.068378
>   0.076799   0.086576   0.096042
>   0.019764   0.032976   0.042544   0.046571   0.054857   0.066023   0.076772
>   0.085750   0.097632   0.110055
>   0.025015   0.028970   0.039292   0.057634   0.060709   0.072690   0.084883
>   0.095403   0.107778   0.119558
>   0.022460   0.037977   0.048971   0.055805   0.066637   0.080303   0.092977
>   0.104676   0.118947   0.133914
>
> octave:10> tocs2
> tocs2 =
>
>   0.0089766   0.0089146   0.0099924   0.0121259   0.0124310   0.0138649
> 0.0158290   0.0192552   0.0206140   0.0254322
>   0.0094038   0.0114406   0.0138560   0.0165397   0.0188648   0.0209138
> 0.0242604   0.0294096   0.0320014   0.0366734
>   0.0094929   0.0140752   0.0179712   0.0210668   0.0250846   0.0287987
> 0.0328435   0.0391932   0.0428656   0.0478950
>   0.0108313   0.0167624   0.0213766   0.0261055   0.0306261   0.0359392
> 0.0416096   0.0477028   0.0532427   0.0595880
>   0.0120316   0.0191212   0.0249160   0.0306268   0.0366570   0.0430312
> 0.0497590   0.0568288   0.0635993   0.0714330
>   0.0134106   0.0213762   0.0281576   0.0358360   0.0423596   0.0499462
> 0.0577566   0.0665130   0.0742680   0.0834634
>   0.0144182   0.0235894   0.0318044   0.0401079   0.0481882   0.0569076
> 0.0661522   0.0759924   0.0852475   0.0952354
>   0.0156541   0.0259586   0.0353074   0.0446630   0.0541522   0.0642590
> 0.0746447   0.0852551   0.0957564   0.1069349
>   0.0168509   0.0283590   0.0388630   0.0494585   0.0602769   0.0712918
> 0.0828704   0.0947692   0.1063678   0.1188150
>   0.0181142   0.0306156   0.0423775   0.0542473   0.0661732   0.0782689
> 0.0911586   0.1041530   0.1170628   0.1323286
>
> octave:11> tocs3
> tocs3 =
>
>   0.0120083   0.0176144   0.0194798   0.0114374   0.0130204   0.0137780
> 0.0162830   0.0170316   0.0202717   0.0253530
>   0.0083260   0.0118194   0.0139199   0.0164822   0.0185746   0.0221460
> 0.0254679   0.0295946   0.0316750   0.0353434
>   0.0098534   0.0143854   0.0175584   0.0215424   0.0250006   0.0287922
> 0.0342940   0.0380244   0.0422662   0.0475070
>   0.0109296   0.0166884   0.0215658   0.0259268   0.0306405   0.0358992
> 0.0418940   0.0472311   0.0532780   0.0597278
>   0.0119736   0.0190616   0.0247258   0.0306218   0.0367832   0.0428558
> 0.0497826   0.0567114   0.0635888   0.0713416
>   0.0132472   0.0214322   0.0281730   0.0356503   0.0424460   0.0500294
> 0.0582558   0.0664224   0.0742614   0.0831792
>   0.0177300   0.0319474   0.0401710   0.0400095   0.0483465   0.0570172
> 0.0667727   0.0755937   0.0849328   0.0981958
>   0.0238613   0.0262291   0.0353556   0.0536186   0.0542682   0.0643420
> 0.0747376   0.0851263   0.0956281   0.1070604
>   0.0200213   0.0369326   0.0481106   0.0494369   0.0601038   0.0713198
> 0.0827417   0.0948833   0.1063394   0.1222183
>   0.0263478   0.0313474   0.0427486   0.0624546   0.0659198   0.0782790
> 0.0913004   0.1041054   0.1168824   0.1304600
>
> octave:12>
>
>
> And, just for the record, cpuinfo is:
>
> [qss:octave] cat /proc/cpuinfo
> processor    : 0
> vendor_id    : AuthenticAMD
> cpu family    : 15
> model        : 75
> model name    : AMD Athlon(tm) 64 X2 Dual Core Processor 4200+
> stepping    : 2
> cpu MHz        : 1000.000
> cache size    : 512 KB
> physical id    : 0
> siblings    : 2
> core id        : 0
> cpu cores    : 2
> apicid        : 0
> initial apicid    : 0
> fpu        : yes
> fpu_exception    : yes
> cpuid level    : 1
> wp        : yes
> flags        : fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca cmov pat
> pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm
> 3dnowext 3dnow rep_good extd_apicid pni cx16 lahf_lm cmp_legacy svm extapic
> cr8_legacy
> bogomips    : 2009.24
> TLB size    : 1024 4K pages
> clflush size    : 64
> cache_alignment    : 64
> address sizes    : 40 bits physical, 48 bits virtual
> power management: ts fid vid ttp tm stc
>
> processor    : 1
> vendor_id    : AuthenticAMD
> cpu family    : 15
> model        : 75
> model name    : AMD Athlon(tm) 64 X2 Dual Core Processor 4200+
> stepping    : 2
> cpu MHz        : 1000.000
> cache size    : 512 KB
> physical id    : 0
> siblings    : 2
> core id        : 1
> cpu cores    : 2
> apicid        : 1
> initial apicid    : 1
> fpu        : yes
> fpu_exception    : yes
> cpuid level    : 1
> wp        : yes
> flags        : fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca cmov pat
> pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm
> 3dnowext 3dnow rep_good extd_apicid pni cx16 lahf_lm cmp_legacy svm extapic
> cr8_legacy
> bogomips    : 2009.24
> TLB size    : 1024 4K pages
> clflush size    : 64
> cache_alignment    : 64
> address sizes    : 40 bits physical, 48 bits virtual
> power management: ts fid vid ttp tm stc
>
>
>

Interesting. May I also inquire what BLAS you use? Is it ATLAS or
something else?


-- 
RNDr. Jaroslav Hajek, PhD
computing expert & GNU Octave developer
Aeronautical Research and Test Institute (VZLU)
Prague, Czech Republic
url: www.highegg.matfyz.cz



reply via email to

[Prev in Thread] Current Thread [Next in Thread]