I don't see a good way to scan assembly output for this optimization, so I've just added the following testcase based on scanning the dump file. Better ideas are welcome. Bernd