public inbox for gcc@gcc.gnu.org
 help / color / mirror / Atom feed
* Re: Inefficient code
@ 2018-07-06 10:18 Bernd Edlinger
  2018-07-06 12:55 ` Paul Koning
  0 siblings, 1 reply; 11+ messages in thread
From: Bernd Edlinger @ 2018-07-06 10:18 UTC (permalink / raw)
  To: Paul Koning; +Cc: gcc

You can get much better code if you make xrci a bit field.
so the entire bit filed region can be accessed word-wise:


#include <stdint.h>

struct Xrb
{
    uint16_t xrlen;             /* Length of I/O buffer in bytes */
    uint16_t xrbc;              /* Byte count for transfer */
    void * xrloc;               /* Pointer to I/O buffer */
    uint8_t xrci:8;             /* Channel number times 2 for transfer */
    uint32_t xrblk:24;  /* Random access block number */
    uint16_t xrtime;    /* Wait time for terminal input */
    uint16_t xrmod;             /* Modifiers */
};

void test(struct Xrb *XRB)
{
    XRB->xrblk = 5;
}


Bernd.

^ permalink raw reply	[flat|nested] 11+ messages in thread
* Inefficient code
@ 2018-07-05 12:46 Paul Koning
  2018-07-05 16:01 ` Segher Boessenkool
  0 siblings, 1 reply; 11+ messages in thread
From: Paul Koning @ 2018-07-05 12:46 UTC (permalink / raw)
  To: GCC Development

I have a struct that looks like this:

struct Xrb
{
    uint16_t xrlen;		/* Length of I/O buffer in bytes */
    uint16_t xrbc;		/* Byte count for transfer */
    void * xrloc;		/* Pointer to I/O buffer */
    uint8_t xrci;		/* Channel number times 2 for transfer */
    uint32_t xrblk:24;	/* Random access block number */
    uint16_t xrtime;	/* Wait time for terminal input */
    uint16_t xrmod;		/* Modifiers */
};

When I write to xrblk (that 24 bit field) on my 16 bit target, I get unexpectly inefficient output:

    XRB->xrblk = 5;

	movb	#5,10(r0)
	clrb	11(r0)
	clrb	7(r0)

rather than the expected word write to the word-aligned lower half of that field.

Looking at the dumps, I see it coming into the RTL expand phase as a single write, which expand then turns into the three insns corresponding to the above.  But (of course) there is a word (HImode) move also, which has the same cost as the byte one.

Is there something I have to do in my target definition to get this to come out right?  This is a strict_alignment target, but alignment is satisfied in this example.  Also, SLOW_BYTE_ACCESS is 1.

	paul


^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2018-07-06 12:55 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-07-06 10:18 Inefficient code Bernd Edlinger
2018-07-06 12:55 ` Paul Koning
  -- strict thread matches above, loose matches on Subject: below --
2018-07-05 12:46 Paul Koning
2018-07-05 16:01 ` Segher Boessenkool
2018-07-05 16:29   ` Paul Koning
2018-07-05 20:44     ` Eric Botcazou
2018-07-05 20:53       ` Paul Koning
2018-07-05 22:47         ` Eric Botcazou
2018-07-06  1:01           ` Paul Koning
2018-07-06  1:04             ` Paul Koning
2018-07-06  6:54               ` Eric Botcazou

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).