Linux kernel problem -- food for thoughts

public inbox for gdb@sourceware.org
 help / color / mirror / Atom feed

* Linux kernel problem -- food for thoughts
@ 2003-04-16 14:19 Elena Zannoni
  2003-04-16 14:28 ` Daniel Jacobowitz
  2003-04-16 20:51 ` Roland McGrath
  0 siblings, 2 replies; 11+ messages in thread
From: Elena Zannoni @ 2003-04-16 14:19 UTC (permalink / raw)
  To: gdb; +Cc: roland, drow

Gdb is currently having a 'little problem' backtracing out of system
calls in x86 kernels which support NPTL. I think the current public
2.5 kernel would make this problem show up.

Right now, if you are in system calls the backtrace will show up as:

 0xffffe002 in ??

Here is an explanation of the problem that Roland has provided:

---------------
Previously asm or C code in libc entered the kernel by setting some
registers and using the "int $0x80" instruction.  e.g.

00000000 <__getpid>:
   0:	b8 14 00 00 00       	mov    $0x14,%eax
   5:	cd 80                	int    $0x80
   7:	c3                   	ret    

That is the function called __getpid in libc, the pre-NPTL build.  (In the
shared library you will see this if you've run with LD_ASSUME_KERNEL=2.4.1
so that /lib/i686/libc.so.6 is what you are using.)

In the new libc (/lib/tls/libc.so.6), that function looks like this:

00000000 <__getpid>:
   0:	b8 14 00 00 00       	mov    $0x14,%eax
   5:	65 ff 15 10 00 00 00 	call   *%gs:0x10
   c:	c3                   	ret    

%gs:0x10 is a location that has been initialized to a kernel-supplied
special entry point address.  In the current kernels, that address is
always 0xffffe000.  But that is not part of the ABI, which is why it's
indirect instead of a literal "call 0xffffe000".  The kernel supplies the
actual entry point address to libc at startup time, and nothing in the
kernel-user interface prevents it from using a different address in each
process if it chose to.

The reason for this is that there can be multiple ways to enter the kernel,
not just the "int $0x80" trap instruction.  Some kernels on some hardware
may use a different method that performs better.  By using this
kernel-supplied entry point address, no user code has to be changed to
select the method.  It's entirely the kernel's choice.

In all the RH kernels we have right now, the entry point page contains:

	0xffffe000:	int $0x80
	0xffffe002:	ret

But user code cannot presume what this code sequence looks like exactly.
It will be some sequence of register and stack moves and special trap
instructions, but you have to disassemble to know exactly.  In the case
above, the PC value seen while a thread is in the kernel is 0xffffe002.
You can disassemble the "ret" there and see that you have to pop the PC off
the stack to recover the caller's frame.  

Another example of what this code might look like when you disassemble it is:

	0xffffe000:	push   %ecx
	0xffffe001:	push   %edx
	0xffffe002:	push   %ebp
	0xffffe003: 	mov    %esp,%ebp
	0xffffe005: 	sysenter 
	0xffffe007:	nop    
	0xffffe008:	nop    
	0xffffe009:	nop    
	0xffffe00a:	nop    
	0xffffe00b:	nop    
	0xffffe00c:	nop    
	0xffffe00d:	nop    
	0xffffe00e: 	jmp    0xffffe003
	0xffffe010:	pop    %ebp
	0xffffe011:	pop    %edx
	0xffffe012:	pop    %ecx
	0xffffe013:	ret    

In this example, depending on what happened inside the kernel the PC you
usually see may be either 0xffffe00e or 0xffffe010.  If the process gets a
signal or you attach asynchronously or so forth, the PC might be at any of
the earlier instructions as well.  You cannot rely on exactly what the
sequence is, so you must be able to disassemble from where you are and
cope.  In this case you will most often see 0xffffe010, in which case you
need to pop those three registers and the PC off the stack to restore the
caller's frame.

So, these cases are like a leaf function with no debugging info.  The
first solution idea was interpreting the epilogue code.  It will
probably be safe to assume that it looks like epilogue code normally
does, i.e. register pops and not any arbitrary instructions.

Another solution I was considering is to have the system somewhere provide
DWARF unwind info matching the possible PC addresses in the vsyscall page.
I am now pretty sure this is the way to go.  The recent development is that
NPTL now needs .eh_frame information for these PCs as well, and Ulrich has
made a kernel change to provide it.  The .eh_frame info for the vsyscall
PCs is on the same read-only kernel page.  The C library now uses this as
if the vsyscall page were a DSO with .eh_frame info to register, so that
exception-style unwinding from any valid PC in a magic entry point works.

So, there is a .eh_frame section available for this code, and getting it
from where it is into gdb can be done by hook or by crook.  I have the
impression that gdb turning an available .eh_frame section into happy
backtraces is something that might be expected real soon now.  
Sounds like a winner.

I think that elucidates all but the dreariest bits of the technical issues.
Now the practical questions.  Oh, one dreary bit: 83172 mostly talks about
the fact that ptrace refuses to read the 0xffffe000 page for you, which is
presumed a prerequisite for dealing with the real can of worms (unwinding).

--------------------

I think right now the public 2.5 kernel has a fix to make the page
readable, and another one to provide the .eh_frame information. There
is no mechanism yet to make that debug info accessible to gdb.

elena

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Linux kernel problem -- food for thoughts
  2003-04-16 14:19 Linux kernel problem -- food for thoughts Elena Zannoni
@ 2003-04-16 14:28 ` Daniel Jacobowitz
  2003-04-16 14:38   ` Elena Zannoni
  2003-04-16 21:04   ` Linux kernel problem -- food for thoughts Roland McGrath
  2003-04-16 20:51 ` Roland McGrath
  1 sibling, 2 replies; 11+ messages in thread
From: Daniel Jacobowitz @ 2003-04-16 14:28 UTC (permalink / raw)
  To: Elena Zannoni; +Cc: gdb, roland

On Wed, Apr 16, 2003 at 10:24:03AM -0400, Elena Zannoni wrote:
> 
> Gdb is currently having a 'little problem' backtracing out of system
> calls in x86 kernels which support NPTL. I think the current public
> 2.5 kernel would make this problem show up.
> 
> Right now, if you are in system calls the backtrace will show up as:
> 
>  0xffffe002 in ??

I was just thinking about this.  My reaction is:
  - the page needs to be readable; I vaguely remember badgering Linus
about this and getting it fixed, but it might have been someone else,
or it might not have gotten fixed.
  - GDB needs to get the location of the EH information from glibc
somehow.  My instinct is to make glibc export this in a global symbol,
just like the way we get signal numbers from linuxthreads.

How does that sound?


Note that we don't use eh information on i386 yet.  We need to fix
that.  I tried once and got distracted by another project, I think :)

> 
> Here is an explanation of the problem that Roland has provided:
> 
> ---------------
> Previously asm or C code in libc entered the kernel by setting some
> registers and using the "int $0x80" instruction.  e.g.
> 
> 00000000 <__getpid>:
>    0:	b8 14 00 00 00       	mov    $0x14,%eax
>    5:	cd 80                	int    $0x80
>    7:	c3                   	ret    
> 
> That is the function called __getpid in libc, the pre-NPTL build.  (In the
> shared library you will see this if you've run with LD_ASSUME_KERNEL=2.4.1
> so that /lib/i686/libc.so.6 is what you are using.)
> 
> In the new libc (/lib/tls/libc.so.6), that function looks like this:
> 
> 00000000 <__getpid>:
>    0:	b8 14 00 00 00       	mov    $0x14,%eax
>    5:	65 ff 15 10 00 00 00 	call   *%gs:0x10
>    c:	c3                   	ret    
> 
> %gs:0x10 is a location that has been initialized to a kernel-supplied
> special entry point address.  In the current kernels, that address is
> always 0xffffe000.  But that is not part of the ABI, which is why it's
> indirect instead of a literal "call 0xffffe000".  The kernel supplies the
> actual entry point address to libc at startup time, and nothing in the
> kernel-user interface prevents it from using a different address in each
> process if it chose to.
> 
> The reason for this is that there can be multiple ways to enter the kernel,
> not just the "int $0x80" trap instruction.  Some kernels on some hardware
> may use a different method that performs better.  By using this
> kernel-supplied entry point address, no user code has to be changed to
> select the method.  It's entirely the kernel's choice.
> 
> In all the RH kernels we have right now, the entry point page contains:
> 
> 	0xffffe000:	int $0x80
> 	0xffffe002:	ret
> 
> But user code cannot presume what this code sequence looks like exactly.
> It will be some sequence of register and stack moves and special trap
> instructions, but you have to disassemble to know exactly.  In the case
> above, the PC value seen while a thread is in the kernel is 0xffffe002.
> You can disassemble the "ret" there and see that you have to pop the PC off
> the stack to recover the caller's frame.  
> 
> Another example of what this code might look like when you disassemble it is:
> 
> 	0xffffe000:	push   %ecx
> 	0xffffe001:	push   %edx
> 	0xffffe002:	push   %ebp
> 	0xffffe003: 	mov    %esp,%ebp
> 	0xffffe005: 	sysenter 
> 	0xffffe007:	nop    
> 	0xffffe008:	nop    
> 	0xffffe009:	nop    
> 	0xffffe00a:	nop    
> 	0xffffe00b:	nop    
> 	0xffffe00c:	nop    
> 	0xffffe00d:	nop    
> 	0xffffe00e: 	jmp    0xffffe003
> 	0xffffe010:	pop    %ebp
> 	0xffffe011:	pop    %edx
> 	0xffffe012:	pop    %ecx
> 	0xffffe013:	ret    
> 
> In this example, depending on what happened inside the kernel the PC you
> usually see may be either 0xffffe00e or 0xffffe010.  If the process gets a
> signal or you attach asynchronously or so forth, the PC might be at any of
> the earlier instructions as well.  You cannot rely on exactly what the
> sequence is, so you must be able to disassemble from where you are and
> cope.  In this case you will most often see 0xffffe010, in which case you
> need to pop those three registers and the PC off the stack to restore the
> caller's frame.
> 
> So, these cases are like a leaf function with no debugging info.  The
> first solution idea was interpreting the epilogue code.  It will
> probably be safe to assume that it looks like epilogue code normally
> does, i.e. register pops and not any arbitrary instructions.
> 
> Another solution I was considering is to have the system somewhere provide
> DWARF unwind info matching the possible PC addresses in the vsyscall page.
> I am now pretty sure this is the way to go.  The recent development is that
> NPTL now needs .eh_frame information for these PCs as well, and Ulrich has
> made a kernel change to provide it.  The .eh_frame info for the vsyscall
> PCs is on the same read-only kernel page.  The C library now uses this as
> if the vsyscall page were a DSO with .eh_frame info to register, so that
> exception-style unwinding from any valid PC in a magic entry point works.
> 
> So, there is a .eh_frame section available for this code, and getting it
> from where it is into gdb can be done by hook or by crook.  I have the
> impression that gdb turning an available .eh_frame section into happy
> backtraces is something that might be expected real soon now.  
> Sounds like a winner.
> 
> I think that elucidates all but the dreariest bits of the technical issues.
> Now the practical questions.  Oh, one dreary bit: 83172 mostly talks about
> the fact that ptrace refuses to read the 0xffffe000 page for you, which is
> presumed a prerequisite for dealing with the real can of worms (unwinding).
> 
> --------------------
> 
> 
> I think right now the public 2.5 kernel has a fix to make the page
> readable, and another one to provide the .eh_frame information. There
> is no mechanism yet to make that debug info accessible to gdb.
> 
> 
> elena
> 

-- 
Daniel Jacobowitz
MontaVista Software                         Debian GNU/Linux Developer

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Linux kernel problem -- food for thoughts
  2003-04-16 14:28 ` Daniel Jacobowitz
@ 2003-04-16 14:38   ` Elena Zannoni
  2003-04-16 14:42     ` Daniel Jacobowitz
  2003-04-16 21:04   ` Linux kernel problem -- food for thoughts Roland McGrath
  1 sibling, 1 reply; 11+ messages in thread
From: Elena Zannoni @ 2003-04-16 14:38 UTC (permalink / raw)
  To: Daniel Jacobowitz; +Cc: Elena Zannoni, gdb, roland

Daniel Jacobowitz writes:
 > On Wed, Apr 16, 2003 at 10:24:03AM -0400, Elena Zannoni wrote:
 > > 
 > > Gdb is currently having a 'little problem' backtracing out of system
 > > calls in x86 kernels which support NPTL. I think the current public
 > > 2.5 kernel would make this problem show up.
 > > 
 > > Right now, if you are in system calls the backtrace will show up as:
 > > 
 > >  0xffffe002 in ??
 > 
 > I was just thinking about this.  My reaction is:
 >   - the page needs to be readable; I vaguely remember badgering Linus
 > about this and getting it fixed, but it might have been someone else,
 > or it might not have gotten fixed.
 >   - GDB needs to get the location of the EH information from glibc
 > somehow.  My instinct is to make glibc export this in a global symbol,
 > just like the way we get signal numbers from linuxthreads.
 > 
 > How does that sound?

Roland (but I'll let him speak) has had a thought about creating a
/proc/pid/vsyscall file, which then gdb could read with add-symbol-file....

the page is readable right now in 2.5 and the patch for the .eh_frame
has been integrated.

core files will also need to be addressed.

 > 
 > 
 > Note that we don't use eh information on i386 yet.  We need to fix
 > that.  I tried once and got distracted by another project, I think :)

Yep, of course.

elena


 > 
 > > 
 > > Here is an explanation of the problem that Roland has provided:
 > > 
 > > ---------------
 > > Previously asm or C code in libc entered the kernel by setting some
 > > registers and using the "int $0x80" instruction.  e.g.
 > > 
 > > 00000000 <__getpid>:
 > >    0:	b8 14 00 00 00       	mov    $0x14,%eax
 > >    5:	cd 80                	int    $0x80
 > >    7:	c3                   	ret    
 > > 
 > > That is the function called __getpid in libc, the pre-NPTL build.  (In the
 > > shared library you will see this if you've run with LD_ASSUME_KERNEL=2.4.1
 > > so that /lib/i686/libc.so.6 is what you are using.)
 > > 
 > > In the new libc (/lib/tls/libc.so.6), that function looks like this:
 > > 
 > > 00000000 <__getpid>:
 > >    0:	b8 14 00 00 00       	mov    $0x14,%eax
 > >    5:	65 ff 15 10 00 00 00 	call   *%gs:0x10
 > >    c:	c3                   	ret    
 > > 
 > > %gs:0x10 is a location that has been initialized to a kernel-supplied
 > > special entry point address.  In the current kernels, that address is
 > > always 0xffffe000.  But that is not part of the ABI, which is why it's
 > > indirect instead of a literal "call 0xffffe000".  The kernel supplies the
 > > actual entry point address to libc at startup time, and nothing in the
 > > kernel-user interface prevents it from using a different address in each
 > > process if it chose to.
 > > 
 > > The reason for this is that there can be multiple ways to enter the kernel,
 > > not just the "int $0x80" trap instruction.  Some kernels on some hardware
 > > may use a different method that performs better.  By using this
 > > kernel-supplied entry point address, no user code has to be changed to
 > > select the method.  It's entirely the kernel's choice.
 > > 
 > > In all the RH kernels we have right now, the entry point page contains:
 > > 
 > > 	0xffffe000:	int $0x80
 > > 	0xffffe002:	ret
 > > 
 > > But user code cannot presume what this code sequence looks like exactly.
 > > It will be some sequence of register and stack moves and special trap
 > > instructions, but you have to disassemble to know exactly.  In the case
 > > above, the PC value seen while a thread is in the kernel is 0xffffe002.
 > > You can disassemble the "ret" there and see that you have to pop the PC off
 > > the stack to recover the caller's frame.  
 > > 
 > > Another example of what this code might look like when you disassemble it is:
 > > 
 > > 	0xffffe000:	push   %ecx
 > > 	0xffffe001:	push   %edx
 > > 	0xffffe002:	push   %ebp
 > > 	0xffffe003: 	mov    %esp,%ebp
 > > 	0xffffe005: 	sysenter 
 > > 	0xffffe007:	nop    
 > > 	0xffffe008:	nop    
 > > 	0xffffe009:	nop    
 > > 	0xffffe00a:	nop    
 > > 	0xffffe00b:	nop    
 > > 	0xffffe00c:	nop    
 > > 	0xffffe00d:	nop    
 > > 	0xffffe00e: 	jmp    0xffffe003
 > > 	0xffffe010:	pop    %ebp
 > > 	0xffffe011:	pop    %edx
 > > 	0xffffe012:	pop    %ecx
 > > 	0xffffe013:	ret    
 > > 
 > > In this example, depending on what happened inside the kernel the PC you
 > > usually see may be either 0xffffe00e or 0xffffe010.  If the process gets a
 > > signal or you attach asynchronously or so forth, the PC might be at any of
 > > the earlier instructions as well.  You cannot rely on exactly what the
 > > sequence is, so you must be able to disassemble from where you are and
 > > cope.  In this case you will most often see 0xffffe010, in which case you
 > > need to pop those three registers and the PC off the stack to restore the
 > > caller's frame.
 > > 
 > > So, these cases are like a leaf function with no debugging info.  The
 > > first solution idea was interpreting the epilogue code.  It will
 > > probably be safe to assume that it looks like epilogue code normally
 > > does, i.e. register pops and not any arbitrary instructions.
 > > 
 > > Another solution I was considering is to have the system somewhere provide
 > > DWARF unwind info matching the possible PC addresses in the vsyscall page.
 > > I am now pretty sure this is the way to go.  The recent development is that
 > > NPTL now needs .eh_frame information for these PCs as well, and Ulrich has
 > > made a kernel change to provide it.  The .eh_frame info for the vsyscall
 > > PCs is on the same read-only kernel page.  The C library now uses this as
 > > if the vsyscall page were a DSO with .eh_frame info to register, so that
 > > exception-style unwinding from any valid PC in a magic entry point works.
 > > 
 > > So, there is a .eh_frame section available for this code, and getting it
 > > from where it is into gdb can be done by hook or by crook.  I have the
 > > impression that gdb turning an available .eh_frame section into happy
 > > backtraces is something that might be expected real soon now.  
 > > Sounds like a winner.
 > > 
 > > I think that elucidates all but the dreariest bits of the technical issues.
 > > Now the practical questions.  Oh, one dreary bit: 83172 mostly talks about
 > > the fact that ptrace refuses to read the 0xffffe000 page for you, which is
 > > presumed a prerequisite for dealing with the real can of worms (unwinding).
 > > 
 > > --------------------
 > > 
 > > 
 > > I think right now the public 2.5 kernel has a fix to make the page
 > > readable, and another one to provide the .eh_frame information. There
 > > is no mechanism yet to make that debug info accessible to gdb.
 > > 
 > > 
 > > elena
 > > 
 > 
 > -- 
 > Daniel Jacobowitz
 > MontaVista Software                         Debian GNU/Linux Developer

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Linux kernel problem -- food for thoughts
  2003-04-16 14:38   ` Elena Zannoni
@ 2003-04-16 14:42     ` Daniel Jacobowitz
  2003-05-22 22:24       ` disassembly in gdb? Kumar Gala
  0 siblings, 1 reply; 11+ messages in thread
From: Daniel Jacobowitz @ 2003-04-16 14:42 UTC (permalink / raw)
  To: Elena Zannoni; +Cc: gdb, roland

On Wed, Apr 16, 2003 at 10:43:12AM -0400, Elena Zannoni wrote:
> Daniel Jacobowitz writes:
>  > On Wed, Apr 16, 2003 at 10:24:03AM -0400, Elena Zannoni wrote:
>  > > 
>  > > Gdb is currently having a 'little problem' backtracing out of system
>  > > calls in x86 kernels which support NPTL. I think the current public
>  > > 2.5 kernel would make this problem show up.
>  > > 
>  > > Right now, if you are in system calls the backtrace will show up as:
>  > > 
>  > >  0xffffe002 in ??
>  > 
>  > I was just thinking about this.  My reaction is:
>  >   - the page needs to be readable; I vaguely remember badgering Linus
>  > about this and getting it fixed, but it might have been someone else,
>  > or it might not have gotten fixed.
>  >   - GDB needs to get the location of the EH information from glibc
>  > somehow.  My instinct is to make glibc export this in a global symbol,
>  > just like the way we get signal numbers from linuxthreads.
>  > 
>  > How does that sound?
> 
> Roland (but I'll let him speak) has had a thought about creating a
> /proc/pid/vsyscall file, which then gdb could read with add-symbol-file....
> 
> the page is readable right now in 2.5 and the patch for the .eh_frame
> has been integrated.
> 
> core files will also need to be addressed.

Oww.  Should we just include the page in core dumps?  That might be
the simplest solution.

Roland, I think that doing it in glibc is a better idea than doing it
in /proc somewhere; it will make remote debugging and core debugging
more straightforward.

>  > Note that we don't use eh information on i386 yet.  We need to fix
>  > that.  I tried once and got distracted by another project, I think :)
> 
> Yep, of course.
> 
> elena
> 
> 
>  > 
>  > > 
>  > > Here is an explanation of the problem that Roland has provided:
>  > > 
>  > > ---------------
>  > > Previously asm or C code in libc entered the kernel by setting some
>  > > registers and using the "int $0x80" instruction.  e.g.
>  > > 
>  > > 00000000 <__getpid>:
>  > >    0:	b8 14 00 00 00       	mov    $0x14,%eax
>  > >    5:	cd 80                	int    $0x80
>  > >    7:	c3                   	ret    
>  > > 
>  > > That is the function called __getpid in libc, the pre-NPTL build.  (In the
>  > > shared library you will see this if you've run with LD_ASSUME_KERNEL=2.4.1
>  > > so that /lib/i686/libc.so.6 is what you are using.)
>  > > 
>  > > In the new libc (/lib/tls/libc.so.6), that function looks like this:
>  > > 
>  > > 00000000 <__getpid>:
>  > >    0:	b8 14 00 00 00       	mov    $0x14,%eax
>  > >    5:	65 ff 15 10 00 00 00 	call   *%gs:0x10
>  > >    c:	c3                   	ret    
>  > > 
>  > > %gs:0x10 is a location that has been initialized to a kernel-supplied
>  > > special entry point address.  In the current kernels, that address is
>  > > always 0xffffe000.  But that is not part of the ABI, which is why it's
>  > > indirect instead of a literal "call 0xffffe000".  The kernel supplies the
>  > > actual entry point address to libc at startup time, and nothing in the
>  > > kernel-user interface prevents it from using a different address in each
>  > > process if it chose to.
>  > > 
>  > > The reason for this is that there can be multiple ways to enter the kernel,
>  > > not just the "int $0x80" trap instruction.  Some kernels on some hardware
>  > > may use a different method that performs better.  By using this
>  > > kernel-supplied entry point address, no user code has to be changed to
>  > > select the method.  It's entirely the kernel's choice.
>  > > 
>  > > In all the RH kernels we have right now, the entry point page contains:
>  > > 
>  > > 	0xffffe000:	int $0x80
>  > > 	0xffffe002:	ret
>  > > 
>  > > But user code cannot presume what this code sequence looks like exactly.
>  > > It will be some sequence of register and stack moves and special trap
>  > > instructions, but you have to disassemble to know exactly.  In the case
>  > > above, the PC value seen while a thread is in the kernel is 0xffffe002.
>  > > You can disassemble the "ret" there and see that you have to pop the PC off
>  > > the stack to recover the caller's frame.  
>  > > 
>  > > Another example of what this code might look like when you disassemble it is:
>  > > 
>  > > 	0xffffe000:	push   %ecx
>  > > 	0xffffe001:	push   %edx
>  > > 	0xffffe002:	push   %ebp
>  > > 	0xffffe003: 	mov    %esp,%ebp
>  > > 	0xffffe005: 	sysenter 
>  > > 	0xffffe007:	nop    
>  > > 	0xffffe008:	nop    
>  > > 	0xffffe009:	nop    
>  > > 	0xffffe00a:	nop    
>  > > 	0xffffe00b:	nop    
>  > > 	0xffffe00c:	nop    
>  > > 	0xffffe00d:	nop    
>  > > 	0xffffe00e: 	jmp    0xffffe003
>  > > 	0xffffe010:	pop    %ebp
>  > > 	0xffffe011:	pop    %edx
>  > > 	0xffffe012:	pop    %ecx
>  > > 	0xffffe013:	ret    
>  > > 
>  > > In this example, depending on what happened inside the kernel the PC you
>  > > usually see may be either 0xffffe00e or 0xffffe010.  If the process gets a
>  > > signal or you attach asynchronously or so forth, the PC might be at any of
>  > > the earlier instructions as well.  You cannot rely on exactly what the
>  > > sequence is, so you must be able to disassemble from where you are and
>  > > cope.  In this case you will most often see 0xffffe010, in which case you
>  > > need to pop those three registers and the PC off the stack to restore the
>  > > caller's frame.
>  > > 
>  > > So, these cases are like a leaf function with no debugging info.  The
>  > > first solution idea was interpreting the epilogue code.  It will
>  > > probably be safe to assume that it looks like epilogue code normally
>  > > does, i.e. register pops and not any arbitrary instructions.
>  > > 
>  > > Another solution I was considering is to have the system somewhere provide
>  > > DWARF unwind info matching the possible PC addresses in the vsyscall page.
>  > > I am now pretty sure this is the way to go.  The recent development is that
>  > > NPTL now needs .eh_frame information for these PCs as well, and Ulrich has
>  > > made a kernel change to provide it.  The .eh_frame info for the vsyscall
>  > > PCs is on the same read-only kernel page.  The C library now uses this as
>  > > if the vsyscall page were a DSO with .eh_frame info to register, so that
>  > > exception-style unwinding from any valid PC in a magic entry point works.
>  > > 
>  > > So, there is a .eh_frame section available for this code, and getting it
>  > > from where it is into gdb can be done by hook or by crook.  I have the
>  > > impression that gdb turning an available .eh_frame section into happy
>  > > backtraces is something that might be expected real soon now.  
>  > > Sounds like a winner.
>  > > 
>  > > I think that elucidates all but the dreariest bits of the technical issues.
>  > > Now the practical questions.  Oh, one dreary bit: 83172 mostly talks about
>  > > the fact that ptrace refuses to read the 0xffffe000 page for you, which is
>  > > presumed a prerequisite for dealing with the real can of worms (unwinding).
>  > > 
>  > > --------------------
>  > > 
>  > > 
>  > > I think right now the public 2.5 kernel has a fix to make the page
>  > > readable, and another one to provide the .eh_frame information. There
>  > > is no mechanism yet to make that debug info accessible to gdb.
>  > > 
>  > > 
>  > > elena
>  > > 
>  > 
>  > -- 
>  > Daniel Jacobowitz
>  > MontaVista Software                         Debian GNU/Linux Developer
> 

-- 
Daniel Jacobowitz
MontaVista Software                         Debian GNU/Linux Developer

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Linux kernel problem -- food for thoughts
  2003-04-16 14:19 Linux kernel problem -- food for thoughts Elena Zannoni
  2003-04-16 14:28 ` Daniel Jacobowitz
@ 2003-04-16 20:51 ` Roland McGrath
  1 sibling, 0 replies; 11+ messages in thread
From: Roland McGrath @ 2003-04-16 20:51 UTC (permalink / raw)
  To: Elena Zannoni; +Cc: gdb, drow

> I think right now the public 2.5 kernel has a fix to make the page
> readable, and another one to provide the .eh_frame information. There
> is no mechanism yet to make that debug info accessible to gdb.

Actually, the page is not yet readable in any 2.5 kernel.  I've sent a
somewhat hacky patch to Linus and am waiting for feedback.  Hopefully that
or a better fix will go into 2.5 pretty soon.  It is correct that the
current 2.5 (after 2.5.67) has the unwind info in the page.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Linux kernel problem -- food for thoughts
  2003-04-16 14:28 ` Daniel Jacobowitz
  2003-04-16 14:38   ` Elena Zannoni
@ 2003-04-16 21:04   ` Roland McGrath
  1 sibling, 0 replies; 11+ messages in thread
From: Roland McGrath @ 2003-04-16 21:04 UTC (permalink / raw)
  To: Daniel Jacobowitz; +Cc: Elena Zannoni, gdb

> I was just thinking about this.  My reaction is:
>   - the page needs to be readable; I vaguely remember badgering Linus
> about this and getting it fixed, but it might have been someone else,
> or it might not have gotten fixed.

Or maybe you just got (or replicated) the half-assed patch I did for this,
which only went to Linus yesterday.

>   - GDB needs to get the location of the EH information from glibc
> somehow.  My instinct is to make glibc export this in a global symbol,
> just like the way we get signal numbers from linuxthreads.

Bletch.  libc isn't the source of the information.  You could have a
program that doesn't use glibc and since winds up at these PC locations.
gdb should get the information from the kernel directly somehow.

[Elena:]
> Roland (but I'll let him speak) has had a thought about creating a
> /proc/pid/vsyscall file, which then gdb could read with add-symbol-file....

I implemented this as a quick hack.  I can send the 2.5 kernel patch to
anyone who is interested.  I am pretty sure this wouldn't get integrated
into the kernel if I lobbied for it.  Since the same address is used in
every process (at least in all the kernels around now), it's also easy to
use a little program (which I wrote first) to generate the file from the
running kernel and then you can use that plain file with add-symbol-file.

This is what I recommend we use for the first development step.  That is,
when the DWARF unwinding support for x86 is enabled generically, we can try
the add-symbol-file trick (or an internal call kludged in) for the purposes
of testing and debugging the unwinding, seeing the testsuite run happily
with good backtraces, etc.  It seems unlikely to be a good real implementation.

For core files, I do think the only sensible thing will be to make the
kernel write the vsyscall page into every core dump.  Otherwise you can't
always be sure when looking at a post-mortem exactly what the PC values it
was using while alive meant at the time.  

For a live implementation not relying on a virtual ELF file, and for core
files, there remains the issue of finding the .eh_frame start address.  I
have some thoughts, and that's a relatively minor detail to be worked out
once the whole plan of using that info is agreed upon.

Thanks,
Roland

^ permalink raw reply	[flat|nested] 11+ messages in thread

* disassembly in gdb?
  2003-04-16 14:42     ` Daniel Jacobowitz
@ 2003-05-22 22:24       ` Kumar Gala
  2003-05-22 22:53         ` Kevin Buettner
  0 siblings, 1 reply; 11+ messages in thread
From: Kumar Gala @ 2003-05-22 22:24 UTC (permalink / raw)
  To: gdb

How does gdb know which disassembler options to use for a given 
architecture target?  For example on PPC there are various flags to 
enable such things as SPE, AltiVec, ISEL, etc.

Is there anyway in an existing gdb to have the disassembler understand 
a different subset of opcodes for the architecture?

thanks.

- kumar

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: disassembly in gdb?
  2003-05-22 22:24       ` disassembly in gdb? Kumar Gala
@ 2003-05-22 22:53         ` Kevin Buettner
  2003-05-22 23:15           ` Kumar Gala
  0 siblings, 1 reply; 11+ messages in thread
From: Kevin Buettner @ 2003-05-22 22:53 UTC (permalink / raw)
  To: Kumar Gala, gdb

On May 22,  5:24pm, Kumar Gala wrote:

> How does gdb know which disassembler options to use for a given 
> architecture target?  For example on PPC there are various flags to 
> enable such things as SPE, AltiVec, ISEL, etc.
> 
> Is there anyway in an existing gdb to have the disassembler understand 
> a different subset of opcodes for the architecture?

This is supposed to be handled by the disassembler_options field in
the disassemble_info struct.  Unfortunately, I don't see anything
in the public GDB sources which set these options for any of the
PPC cores.  (mips and i386 have some code which do this though.)

Kevin

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: disassembly in gdb?
  2003-05-22 22:53         ` Kevin Buettner
@ 2003-05-22 23:15           ` Kumar Gala
  2003-05-22 23:52             ` Kevin Buettner
  0 siblings, 1 reply; 11+ messages in thread
From: Kumar Gala @ 2003-05-22 23:15 UTC (permalink / raw)
  To: Kevin Buettner; +Cc: gdb

Can you be more specific as to where in i386 or mips that 
disassemble_info is set for those targets?

- kumar

On Thursday, May 22, 2003, at 05:51 PM, Kevin Buettner wrote:

> On May 22,  5:24pm, Kumar Gala wrote:
>
>> How does gdb know which disassembler options to use for a given
>> architecture target?  For example on PPC there are various flags to
>> enable such things as SPE, AltiVec, ISEL, etc.
>>
>> Is there anyway in an existing gdb to have the disassembler understand
>> a different subset of opcodes for the architecture?
>
> This is supposed to be handled by the disassembler_options field in
> the disassemble_info struct.  Unfortunately, I don't see anything
> in the public GDB sources which set these options for any of the
> PPC cores.  (mips and i386 have some code which do this though.)
>
> Kevin

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: disassembly in gdb?
  2003-05-22 23:15           ` Kumar Gala
@ 2003-05-22 23:52             ` Kevin Buettner
  2003-05-23 15:15               ` Andrew Cagney
  0 siblings, 1 reply; 11+ messages in thread
From: Kevin Buettner @ 2003-05-22 23:52 UTC (permalink / raw)
  To: Kumar Gala, Kevin Buettner; +Cc: gdb

On May 22,  6:15pm, Kumar Gala wrote:

> Can you be more specific as to where in i386 or mips that 
> disassemble_info is set for those targets?

For i386, look at i386_print_insn() in i386-tdep.c.
For mips, look at mips_gdbarch_init() in mips-tdep.c.

Kevin

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: disassembly in gdb?
  2003-05-22 23:52             ` Kevin Buettner
@ 2003-05-23 15:15               ` Andrew Cagney
  0 siblings, 0 replies; 11+ messages in thread
From: Andrew Cagney @ 2003-05-23 15:15 UTC (permalink / raw)
  To: Kevin Buettner; +Cc: Kumar Gala, gdb

See also:

http://sources.redhat.com/gdb/bugs/1177
http://sources.redhat.com/gdb/bugs/548

The current prefered command is "set <cpu> disassembler".

Andrew

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2003-05-23 15:15 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-04-16 14:19 Linux kernel problem -- food for thoughts Elena Zannoni
2003-04-16 14:28 ` Daniel Jacobowitz
2003-04-16 14:38   ` Elena Zannoni
2003-04-16 14:42     ` Daniel Jacobowitz
2003-05-22 22:24       ` disassembly in gdb? Kumar Gala
2003-05-22 22:53         ` Kevin Buettner
2003-05-22 23:15           ` Kumar Gala
2003-05-22 23:52             ` Kevin Buettner
2003-05-23 15:15               ` Andrew Cagney
2003-04-16 21:04   ` Linux kernel problem -- food for thoughts Roland McGrath
2003-04-16 20:51 ` Roland McGrath

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).