From mboxrd@z Thu Jan 1 00:00:00 1970 From: Martin Kahlert To: axp-list@redhat.com Cc: scr@iis.fhg.de, robert@physiol.med.tu-muenchen.de, egcs@cygnus.com Subject: Re: Performance measurements (thanks and conclusion) Date: Thu, 25 Jun 1998 03:09:00 -0000 Message-id: <19980625094618.A26562@keksy.mchp.siemens.de> References: <199806240851.KAA06049@keksy.mchp.siemens.de> <3590D5AE.167EB0E7@iis.fhg.de> <19980624124843.A15248@keksy.mchp.siemens.de> <3591031A.2781E494@iis.fhg.de> <19980624170051.21290@haegar.physiol.med.tu-muenchen.de> X-SW-Source: 1998-06/msg00893.html Quoting Robert Wilhelm (robert@physiol.med.tu-muenchen.de): > > [Robert: could you please compile my code on your Alpha using > > egcs and report your results to me] > > I get about 275 MFLOPS for my 533MHz 21164a for both egcs 1.0 and > egcs-current with haifa enabled. > > If I use different local variables lfA*, egcs seems to shedule a bit > better and I get 290 MFLOPS. > > Robert I was really overwhelmed with the repsonse to this thread on axp-list. Thanks a lot for all people who tried my source and even tried to get more out of the compilers. I tried both versions on my PPro 200: Stefan Schroepfer's version: pgcc: 85.98 MFLOPS gcc-2.7.2.1: 97.10 MFLOPS gcc-without double align: 95.46 MFLOPS egcs-2.91.42: 84.06 MFLOPS tcc: 17.20 MFLOPS Robert Wilhelm's version: pgcc: 81.62 MFLOPS gcc-2.7.2.1: 98.81 MFLOPS gcc-without double align: 98.81 MFLOPS egcs-2.91.42: 83.44 MFLOPS tcc: 16.44 MFLOPS It seems that tcc is not the fastest and the most reliable under the sun... Can i conclude, that it's a good idea to insert as many local vars as possible to get good results from compilers? Now i have two questions: -Why is it so difficult for gcc to transform the code for(i=0;i