Optimiziation questions (c++)

public inbox for gcc-help@gcc.gnu.org
 help / color / mirror / Atom feed

* Optimiziation questions (c++)
@ 2007-12-06 17:59 NightStrike
  2007-12-06 20:31 ` Brian Dessent
  2008-02-07  3:41 ` NightStrike
  0 siblings, 2 replies; 4+ messages in thread
From: NightStrike @ 2007-12-06 17:59 UTC (permalink / raw)
  To: gcc-help

(1)
I am curious about a certain aspect of how the g++ optimizer works at
-O3.  If I have a for-loop, there are times when it is beneficial to
have a function call in the while-condition.  For instance:

vector<int> v;
...
for ( int i=1; i < v.size(); ++i)

Now, I if v is not changing inside of that for-loop, then this would
appear at first glance to be more efficient:

int max = v.size();
for ( int i=1; i < max; ++i)

By using 'max' instead of v.size(), I am eliminating a function call
from every iteration of the for loop.  If the loop is iterated a large
number of times, this could have an advantage.  However, is it all the
same when you are optimizing?  Does g++ remove the need for the second
method?

Before I end this email, I am sure that at least someone is thinking,
"Wait, you're using a vector, and using an old style for loop to
traverse it instead of iterators!"  Well, yes, that's true.  But I'm
doing that for two reasons -- 1) I'm interested in the question as it
stands, so I asked it the way I did, and 2) My next question will
already cover that, as it involves a comparison between traversing an
array as I do above and using iterators.

(2)
Is there a big difference in speed between using a standard for-loop
for vector traversal and using iterators?  I already verified that I
can get a significant speed improvement (with very large N) by
accessing the vector directory with v[i] instead of using the at()
method.  But what about iterators?  If I write it with iterators
instead, will that be optimized out at -O3 to reduce to essentially
the same thing?

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Optimiziation questions (c++)
  2007-12-06 17:59 Optimiziation questions (c++) NightStrike
@ 2007-12-06 20:31 ` Brian Dessent
  2008-02-07  3:41 ` NightStrike
  1 sibling, 0 replies; 4+ messages in thread
From: Brian Dessent @ 2007-12-06 20:31 UTC (permalink / raw)
  To: NightStrike; +Cc: gcc-help

NightStrike wrote:

> By using 'max' instead of v.size(), I am eliminating a function call
> from every iteration of the for loop.  If the loop is iterated a large
> number of times, this could have an advantage.  However, is it all the
> same when you are optimizing?  Does g++ remove the need for the second
> method?

The compiler can hoist the condition out of the loop, but only if it can
determine that v is not modified inside the loop.  It might know this if
it can determine that all function calls inside the loop are pure, or if
v is a local auto variable that's not had its address taken.  There
could be other ways it can prove this, I don't know.  It's hard to
really say much in the abstract about what the compiler might or might
not do, because so much depends on the details of the code and the
information that the compiler has on hand.  A testcase is much more
valuable because you can look at the -S output and see for sure.

Brian

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Optimiziation questions (c++)
  2007-12-06 17:59 Optimiziation questions (c++) NightStrike
  2007-12-06 20:31 ` Brian Dessent
@ 2008-02-07  3:41 ` NightStrike
  2008-02-07 16:01   ` Tony Wetmore
  1 sibling, 1 reply; 4+ messages in thread
From: NightStrike @ 2008-02-07  3:41 UTC (permalink / raw)
  To: gcc-help

On 12/6/07, NightStrike <nightstrike@gmail.com> wrote:
> (1)
> I am curious about a certain aspect of how the g++ optimizer works at
> -O3.  If I have a for-loop, there are times when it is beneficial to
> have a function call in the while-condition.  For instance:
>
> vector<int> v;
> ...
> for ( int i=1; i < v.size(); ++i)
>
> Now, I if v is not changing inside of that for-loop, then this would
> appear at first glance to be more efficient:
>
> int max = v.size();
> for ( int i=1; i < max; ++i)
>
> By using 'max' instead of v.size(), I am eliminating a function call
> from every iteration of the for loop.  If the loop is iterated a large
> number of times, this could have an advantage.  However, is it all the
> same when you are optimizing?  Does g++ remove the need for the second
> method?
>
>
> Before I end this email, I am sure that at least someone is thinking,
> "Wait, you're using a vector, and using an old style for loop to
> traverse it instead of iterators!"  Well, yes, that's true.  But I'm
> doing that for two reasons -- 1) I'm interested in the question as it
> stands, so I asked it the way I did, and 2) My next question will
> already cover that, as it involves a comparison between traversing an
> array as I do above and using iterators.
>
>
> (2)
> Is there a big difference in speed between using a standard for-loop
> for vector traversal and using iterators?  I already verified that I
> can get a significant speed improvement (with very large N) by
> accessing the vector directory with v[i] instead of using the at()
> method.  But what about iterators?  If I write it with iterators
> instead, will that be optimized out at -O3 to reduce to essentially
> the same thing?
>


I received feedback on #1.  Any ideas on #2 from this email?

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Optimiziation questions (c++)
  2008-02-07  3:41 ` NightStrike
@ 2008-02-07 16:01   ` Tony Wetmore
  0 siblings, 0 replies; 4+ messages in thread
From: Tony Wetmore @ 2008-02-07 16:01 UTC (permalink / raw)
  To: NightStrike, GCC Help Mailing List

NightStrike wrote:
 >> (2)
 >> Is there a big difference in speed between using a standard for-loop
 >> for vector traversal and using iterators?  I already verified that I
 >> can get a significant speed improvement (with very large N) by
 >> accessing the vector directory with v[i] instead of using the at()
 >> method.  But what about iterators?  If I write it with iterators
 >> instead, will that be optimized out at -O3 to reduce to essentially
 >> the same thing?
 >
 > I received feedback on #1.  Any ideas on #2 from this email?

Since you appear to already have a test case to measure and compare 
performance, why not simply do the investigation yourself?

I am not a GCC internals expert, but the at() method *should* be 
expected to perform much slower than direct indexing.  I believe the 
at() method is required to bounds-check every access to the vector, and 
throw an exception, while direct indexing is not.

As for iterator-loop optimization vs. for-loop optimization, that likely 
depends on whether the vector::size() method is inlined and the call 
optimized away by GCC.  In the iterator-loop case, it will depend on 
whether the vector::end() method is similarly optimized away, since that 
will need to be called for every loop iteration, just as vector::size() 
would be called in the for-loop.

I think an easy way to determine the precise optimization would be to 
look at the assembly code generated by GCC.  I would expect a very 
simple test case to result in a very manageable amount of assembly code 
to analyze.

I wrote a simple program to do just that and the optimized version using 
iterators would appear to be slightly more efficient to my untrained 
eyes, but that might depend on the exact cost of individual assembly 
instructions on your architecture.  In the -O3 case, each function 
contains approximately 20 lines of assembly code.

I cannot say what sort of improvement, if any, that might provide over a 
large data set, since I did not measure performance.

Here is the program I wrote:

#include <cstdio>
#include <vector>
void printForLoop( std::vector<int>& intVec )
{
   for( size_t i = 0; i < intVec.size(); ++i )
     printf( "%d\n", intVec[i] );
}
void printIterator( std::vector<int>& intVec )
{
   for( std::vector<int>::iterator it = intVec.begin(); it != 
intVec.end(); ++it )
     printf( "%d\n", *it );
}
int main()
{
   std::vector<int> intVec;
   for( size_t i = 0; i < 10; ++i )
     intVec.push_back( i );
   printForLoop( intVec );
   printIterator( intVec );
}

I examined the assembly code by disassembling the two functions in GDB. 
  When I compiled with -O3, neither function made a call to check the 
loop boundary conditions.  That is, both vector.size() and vector.end() 
were optimized away by GCC.

You could instead have GCC save the assembly code to a file, but loading 
the code into GDB seemed just as easy.  Either way, I highly recommend 
examining the generated assembly code yourself, as it can be quite 
instructive.

--
Tony Wetmore

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2008-02-07 16:01 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-12-06 17:59 Optimiziation questions (c++) NightStrike
2007-12-06 20:31 ` Brian Dessent
2008-02-07  3:41 ` NightStrike
2008-02-07 16:01   ` Tony Wetmore

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).