JakeHillion/drgn

mirror of https://github.com/JakeHillion/drgn.git synced 2024-12-22 17:23:06 +00:00

Author	SHA1	Message	Date
Michel Alexandre Salim	c0ed1a3203	Fix spelling error abbrevation => abbreviation; caught by Debian's lintian Signed-off-by: Michel Alexandre Salim <michel@michel-slm.name>	2022-08-17 21:45:51 -07:00
Omar Sandoval	6c90315f6f	python: fix FaultError reference leak PyErr_SetObject() takes a reference on the exception value, so we need to drop the reference we got when we created the value. Issue #196 ran into this by reading tons of unmapped addresses. Fixes: `80fef04c70` ("Add address attribute to FaultError exception") Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-08-16 17:35:36 -07:00
Omar Sandoval	a19203a73e	libdrgn: fix QEMU guest memory dump Kconfig suggestion The config option is and always has been CONFIG_FW_CFG_SYSFS, not CONFIG_FW_CFG. Also suggest the user-visible CONFIG_KEXEC instead of the internal CONFIG_CRASH_CORE. Fixes: `2bd861f719` ("libdrgn: program: detect QEMU guest memory dumps without VMCOREINFO") Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-08-15 15:11:56 -07:00
Omar Sandoval	faaf01ad1b	Add drgn.StackTrace.prog and drgn_stack_trace_program() If we only have the stack trace available, it's useful to get the program it came from. This'll be used eventually for helpers that take a stack trace. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-08-11 14:45:54 -07:00
Omar Sandoval	e3ba4d2f99	drgn 0.0.20 Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-25 16:52:28 -07:00
Omar Sandoval	e9d16732d6	libdrgn: x86_64: fix page table iteration over non-canonical range We're currently checking whether the iterator has entered the non-canonical range when fetching the last level of the page table, but the cutover actually happens while we're in the last level. Fix it by doing the check unconditionally. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-24 00:03:45 -07:00
Jay Kamat	063850325f	libdrgn: dwarf: look up complete types in namespaces drgn_debug_info_find_complete() looks up the name of the incomplete type in the global namespace. This is incorrect for C++: we need to look it up in the namespace that the DIE is in. To find the containing namespace, we need to do a DIE ancestor walk. We don't want to do this for C, so add a flag indicating whether a language has namespaces to struct drgn_language. If it's true, then we do the ancestor walk and then look up the name in the appropriate namespace. Signed-off-by: Jay Kamat <jaygkamat@gmail.com>	2022-07-15 16:02:56 -07:00
Omar Sandoval	db3babd42e	libdrgn: aarch64: implement page table iterator Now that we made the other memory management helpers generic, the last thing to implement for AArch64 is page table walking. This looks a lot like the x86-64 equivalent but has to support the various page and virtual address sizes that can be configured for AArch64. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-14 12:23:08 -07:00
Omar Sandoval	b28bd9f0a3	libdrgn: linux_kernel: get vmemmap generically AArch64 has changed the location of vmemmap multiple times, and not all of these can be easily distinguished. Rather than restorting to kernel version checks, this replaces the vmemmap architecture callback with a generic approach that gets the vmemmap address directly from the mem_section table. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-14 12:05:11 -07:00
Omar Sandoval	a213573b23	libdrgn: linux_kernel: make virt_to_phys() and phys_to_virt() generic On x86-64, the difference between virtual addresses in the direct map and the corresponding physical addresses is called PAGE_OFFSET, so we exposed that via an architecture callback and the Linux kernel object finder. However, this doesn't translate to other architectures. Namely, on AArch64, the difference is PAGE_OFFSET - PHYS_OFFSET, and both PAGE_OFFSET and PHYS_OFFSET have varied over time and between configurations. We can remove the architecture callback and avoid version-specific logic by letting the page table tell us the offset. We just need an address in the direct map, which is easy to find since this includes kmalloc and memblock allocations. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-14 12:05:11 -07:00
Omar Sandoval	5fe38c7371	libdrgn: linux_kernel: fix read_vm() coalescing comparison linux_helper_read_vm() has logic to merge adjacent physical address ranges returned by the page table iterator. However, the check for whether the ranges are adjacent is incorrect. Fix it. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-14 12:05:11 -07:00
Omar Sandoval	571949a743	libdrgn: x86_64: don't bother zeroing cached page table on initialization pgtable_iterator_x86_64::table is only used if pgtable_iterator_x86_64::index indicates that it has any cached entries, so there's no point initializing table since we initialize index to indicate that nothing is cached. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-14 12:05:11 -07:00
Omar Sandoval	36fecd1ded	libdrgn: refactor page table iterators AArch64 will need different sizes of page table iterators depending on the page size and virtual address size. Rather than the static pgtable_iterator_arch_size, allow architectures to define callbacks for allocating and freeing a page table iterator. Also remove the generic page table iterator wrapper and just pass that information to the iterator function. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-14 12:05:11 -07:00
Omar Sandoval	b3a6d6a35f	libdrgn: linux_kernel: cache PAGE_SHIFT derived from PAGE_SIZE Rather than computing it every time we need it, compute it once when we parse PAGE_SIZE from VMCOREINFO (and validate that PAGE_SIZE is a power of two). This will be more important for AArch64 page table walking. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-07-14 12:05:09 -07:00
Kevin Svetlitski	5aaf3db6fc	libdrgn: support reference and absent objects with float types which aren't 32 or 64 bits Very similar to `a541e9b170`, but adds partial support for floats (as opposed to integers) which aren't 32 or 64 bits. Signed-off-by: Kevin Svetlitski <svetlitski@fb.com>	2022-07-06 15:47:18 -07:00
Kevin Svetlitski	661d6a186c	Add support for UTF character base types Previously `drgn` did not recognize the `DW_ATE_UTF` encoding for base types, and consequently could not handle `char8_t`, `char16_t`, or `char32_t`. This has been remedied, and a corresponding test case added to prevent regressions. Signed-off-by: Kevin Svetlitski <svetlitski@fb.com>	2022-07-06 09:44:16 -07:00
Omar Sandoval	2bd861f719	libdrgn: program: detect QEMU guest memory dumps without VMCOREINFO Issue #182 reported that a core dump created by QEMU's dump-guest-memory command confuses drgn: by default, it only has NT_PRSTATUS notes and QEMU state notes for each CPU, so drgn thinks it's a userspace core dump, and it doesn't have the necessary VMCOREINFO to use it as a Linux kernel core dump. It turns out that QEMU and Linux can cooperate to add a VMCOREINFO note to the guest memory dump, which suffices for drgn. Let's detect a QEMU guest memory dump without a VMCOREINFO note and include instructions on how to capture a QEMU dump that makes drgn happy. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-28 00:41:05 -07:00
Omar Sandoval	63c0684b68	libdrgn: aarch64: mask away pointer authentication code in return addresses Now that we track RA_SIGN_STATE and get the pointer authentication code mask, we can remove the pointer authentication code from the return address while unwinding. Add a new architecture callback, ->demangle_return_address(), for this purpose. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-26 09:18:07 -07:00
Omar Sandoval	61befc1606	libdrgn: parse AArch64 PAC mask from core dumps In order to support removing the pointer authentication code (PAC) from return addresses on AArch64, we need to know what bits are being used for the PAC. We can get this from the NT_ARM_PAC_MASK note in userspace core dumps and from the NUMBER(KERNELPACMASK) field in VMCOREINFO for Linux kernel core dumps. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-26 09:18:07 -07:00
Omar Sandoval	3cba315293	libdrgn: linux_kernel: use memswitch for drgn_program_parse_vmcoreinfo() We currently have 5 names that we match against, and there are more on the way, so we might as well use a memswitch. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-26 09:18:07 -07:00
Omar Sandoval	9da9f6a871	libdrgn: fold struct vmcoreinfo into struct drgn_program In an upcoming commit, we will parse the AArch64 pointer authentication code mask either from the VMCOREINFO note or the NT_ARM_PAC_MASK note. Since it doesn't always come from VMCOREINFO, it doesn't make sense to put it in struct vmcoreinfo; struct drgn_program makes more sense. So, make parse_vmcoreinfo() take struct drgn_program instead of struct vmcoreinfo, rename it to drgn_program_parse_vmcoreinfo(), and replace struct vmcoreinfo with an anonymous struct in struct drgn_program. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-26 09:18:07 -07:00
Omar Sandoval	4d1b608507	libdrgn: aarch64: add RA_SIGN_STATE pseudo-register and DW_CFA_AARCH64_negate_ra_state The RA_SIGN_STATE pseudo-register indicates whether the return address is signed with a pointer authentication code. Add it to the register definitions. It can be set through a normal CFI register rule or the vendor-specific DW_CFA_AARCH64_negate_ra_state rule. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-26 09:18:07 -07:00
Omar Sandoval	9c9a2136f1	libdrgn: cfi: add rule to set register to constant This will be used to implement DW_CFA_AARCH64_negate_ra_state. Also fix a typographical error in a nearby comment. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-26 09:18:07 -07:00
Omar Sandoval	6bc55036e2	libdrgn: aarch64: add stack unwinding support Add the basic register definitions and stack unwinding support functions. Pointer authentication support will be added in subsequent commits. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-26 09:18:07 -07:00
Omar Sandoval	ad119cc1a6	libdrgn: ppc64: fix fallback unwinding Reading the ABI specification, I realized that fallback_unwind_ppc64() is completely wrong. Fix it. Fixes: `eec67768aa` ("libdrgn: replace elfutils DWARF unwinder with our own") Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-25 22:39:30 -07:00
Omar Sandoval	deabe2cb56	libdrgn: register_state: add and use drgn_register_state_get_u64() This factors out some boilerplate for getting registers as a uint64_t. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-25 22:39:30 -07:00
Omar Sandoval	0a7849d791	libdrgn: rename drgn_register_state_set_from_integer() -> from_u64() This is for consistency with drgn_register_state_get_u64() that we're about to add. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-25 22:39:30 -07:00
Omar Sandoval	cbdf6094b7	libdrgn: ppc64: fix DWARF link register confusion The usage of the link register in DWARF is a little confusing. On entry to a function, the link register contains the address that should be returned to. However, for DWARF, the link register is usually used as the CFI return_address_register, which means that in an unwound frame, it will contain the same thing as the program counter. I initially thought that this was a mistake, believing that the link register should contain the _next_ return address. However, after a return (with the blr instruction), the link register will indeed contain the same address as the program counter. This is consistent with our documentation of register values for function call frames: "the register values are the values when control returns to this frame". So, rename our internal "ra" register to "lr", expose it to the API, and add a little more documentation to the ppc64 initial register code. Fixes: `221a218704` ("libdrgn: add powerpc stack trace support") Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-25 22:39:30 -07:00
Omar Sandoval	49ae42ccfd	libdrgn: x86-64: add a few more register definitions In additional to the general-purpose registers, struct pt_regs also provides the cs and ss segment registers and the rflags register. elf_gregset_t provides the other segment registers as well. We should expose all of those. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-25 22:39:30 -07:00
Omar Sandoval	33d14f7703	libdrgn: rework architecture definition files Currently, register definitions are split across two files: arch_foo.defs lists the names of registers, and arch_foo.c defines the layout used to store registers in memory. The main rationale for this was that the layout could be processed entirely by the C preprocessor, but the register names needed an AWK script that we wanted to keep minimal. But since commit `af6f5a887d` ("libdrgn: replace gen_arch.awk with gen_arch_inc_strswitch.py"), arch_foo.defs is processed by a Python script. Let's define both the register names and the register layout in a new file, arch_foo_defs.py, which is processed by gen_arch_inc_strswitch.py This has a few benefits: * It puts all of the register definitions for an architecture in one place. * It is easier to maintain than preprocessor magic. (It also makes it trivial to support registers that don't exist in DWARF, which would've been harder to do with our preprocessor code.) * It gets rid of our DSL in favor of Python (which also lets us reduce repetition for the ppc64 definitions). Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-25 22:39:26 -07:00
Omar Sandoval	5681b99c3a	libdrgn: remove unused struct drgn_register::dwarf_number This hasn't been used since commit `eec67768aa` ("libdrgn: replace elfutils DWARF unwinder with our own"). Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-24 09:17:56 -07:00
Omar Sandoval	42e37e72c1	libdrgn: stack_trace: fix byte order for drgn_stack_frame_register() drgn_stack_frame_register() gets the register value with copy_lsbytes() and then byte swaps it if the program's byte order is different from the host's. But, copy_lsbytes() already fixes the byte order, so this ends up with the original (wrong) byte order. We also don't need to zero out the integer that we copy into since copy_lsbytes() also does that. Fixes: `eec67768aa` ("libdrgn: replace elfutils DWARF unwinder with our own") Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-24 09:17:56 -07:00
Omar Sandoval	ebfcabd5a8	libdrgn: linux_kernel: match explicitly-reported kernel modules by build ID Currently, we identify explicitly-reported kernel modules by the module name that we get from the .modinfo or the .gnu.linkonce.this_module section. However, objcopy --only-keep-debug (used for some Linux distro's separate debug files) does not keep these sections. This means that passing a file processed by objcopy --only-keep-debug to, e.g., drgn -s, fails with "could not find kernel module name". Instead of using the module name as the identifier, let's use the module's GNU build ID. We can get it on a live system from /sys/module/<module>/notes/, and on a core dump from struct module::notes_attrs (which is the implementation of that sysfs directory). This was split out of my larger debug info discovery rework, which will make more use of the build ID. Closes #178. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-01 14:21:12 -07:00
Omar Sandoval	c9265ef6d6	libdrgn: move alloc_or_reuse() to util.h Preparation for using it elsewhere. Also make it take a size_t instead of uint64_t. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-06-01 05:11:58 -07:00
Omar Sandoval	3595c81a8c	libdrgn: binary_search_tree: move member and entry_to_key to DEFINE_BINARY_SEARCH_TREE_FUNCTIONS() DEFINE_BINARY_SEARCH_TREE_TYPE() doesn't need these. This is preparation for a potential new use of a BST. But, it's also a good cleanup on its own and allows us to move some code out of memory_reader.h and into memory_reader.c. (This is similar to commit `1339dc6a2f` ("libdrgn: hash_table: move entry_to_key to DEFINE_HASH_TABLE_FUNCTIONS()").) Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-05-24 15:26:39 -07:00
Omar Sandoval	931d9c999d	libdrgn: linux_kernel: get module address range directly Instead of getting the address range from the sections we find, get it directly from /proc/modules or from the `struct module`. (We already had partial code to get the address range, but I can't remember why I didn't use it.) The real motivation for this is the upcoming module rework: it'll allow us to report the module and its address range before iterating through its sections. But it also means that we don't need the heuristic to ignore special sections that shouldn't be considered part of the address range (e.g., .init, .data..percpu [the latter of which we should be ignoring but get away with not because it's excluded from sysfs]). Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-05-23 16:27:49 -07:00
Omar Sandoval	7bbe0bc1c6	libdrgn: add drgn_error_dwrite() Like drgn_error_fwrite(), but writes to a file descriptor instead of a stdio stream. This will be used for logging. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-05-23 11:09:10 -07:00
Omar Sandoval	09ee4daed3	libdrgn: fix drgn_error_fwrite() for DRGN_ERROR_FAULT drgn_error_fwrite() only calls string_builder_append_error() to get special formatting for DRGN_ERROR_OS, but DRGN_ERROR_FAULT also needs special formatting. Rather than needing to keep drgn_error_fwrite() and string_builder_append_error() in sync, define them both in terms of a common macro. Fixes: `80fef04c70` ("Add address attribute to FaultError exception") Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-05-23 11:08:54 -07:00
Omar Sandoval	bc582b8a58	drgn 0.0.19 Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-05-18 13:57:02 -07:00
Omar Sandoval	a3b72e33c8	Fix some more flake8 errors Several have snuck in since the last time I did this in commit `5541fad063` ("Fix some flake8 errors"). Prepare for adding flake8 to pre-commit by fixing them. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-05-17 15:23:42 -07:00
Omar Sandoval	a541e9b170	libdrgn: support reference and absent objects with >64-bit integer types GCC and Clang have 128-bit integer types on 64-bit targets: __int128 and unsigned __int128. Clang additionally has N-bit integers of up to 2<<24 bits with _ExtInt(N), which was standardized in C23 as _BitInt(N). Currently, we disallow creating objects with a >64-bit integer type. Jay Kamat reported that this would cause errors when examining some binaries. The reason we disallow this is that we don't have a way to represent or do operations on >64-bit values. We could make use of a bignum library like GMP to do this in the future. However, for now, we can loosen this restriction and at least allow reference and absent objects with big integer types. This requires enforcing two things: that we never create a value object with a >64-bit integer type, and that we never read the value of a reference object with a >64-bit integer type. Co-authored-by: Jay Kamat <jaygkamat@gmail.com> Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-28 13:38:38 -07:00
Omar Sandoval	084e636341	libdrgn: add DRGN_ERROR_NOT_IMPLEMENTED This will be used for partial 128-bit object support. There are other places that should probably be converted to use it. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-28 13:38:38 -07:00
Omar Sandoval	ad7e64d9d8	libdrgn: make memdup() take a const void * Just like strdup(), memdup() doesn't modify the thing it's copying, so mark it const. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-27 00:37:46 -07:00
Omar Sandoval	14642fb3b6	libdrgn: add stub RISC-V architecture with relocation implementation The 32-bit and 64-bit variants have different register sizes, so they're different architectures in drgn. For now, put them in the same file so that they can share the relocation implementation. We'll need to figure out how to handle registers later. P.S. RISC-V has the weirdest relocations so far. /proc/kcore also appears to be broken. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-19 11:51:23 -07:00
Omar Sandoval	d27204260e	libdrgn: add stub Arm architecture with relocation implementation The only relocation type I saw in Debian's kernel module debug info was R_ARM_ABS32. R_ARM_REL32 is easy. The Linux kernel supports a bunch of other ones that don't seem relevant to debug info. Unfortunately, I wasn't able to test this because /proc/kcore doesn't exist on Arm. This apparently goes all the way back to 2003: https://lwn.net/Articles/45315/. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-19 00:25:05 -07:00
Omar Sandoval	3f246f7054	libdrgn: add stub AArch64 architecture with relocation implementation The only relocation types I saw in Debian's kernel module debug info were R_AARCH64_ABS64 and R_AARCH64_ABS32. R_AARCH64_ABS16, R_AARCH64_PREL64, R_AARCH64_PREL32, and R_AARCH64_PREL16 are all easy. The remaining types supported by the Linux kernel are for movw and immediate instructions, which aren't relevant to debug info. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-19 00:23:56 -07:00
Omar Sandoval	7535838cd5	libdrgn: add stub i386 architecture with relocation implementation The only relocation type I saw in Debian's kernel module debug info was R_386_32. R_386_PC32 is easy. The Linux kernel also supports R_386_PLT32, but that's the same story as R_X86_64_PLT32 in x86-64, so we don't implement it for now. I was torn between naming it i386, x86, or IA-32. x86 isn't immediately clear whether x86-64 is included or not. No one other than Intel calls it IA-32. i386 might incorrectly imply that it is strictly the original i386 instruction set with no later extensions, but the more general meaning is used frequently in the Linux world (e.g., Debian and QEMU both call it i386), so I went with that in the end. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-19 00:21:59 -07:00
Omar Sandoval	03f9f339e5	libdrgn: ppc64: add relocation implementation One of the biggest things we depend on libdwfl for is applying relocations on architectures other than x86-64. I'm exploring the possibility of removing the libdwfl dependency, so I'm going to add relocation implementations for more architectures, starting with ppc64. R_PPC64_ADDR32 and R_PPC64_ADDR64 were the only ones I saw in Debian's kernel module debug info. R_PPC64_REL32 and R_PPC64_REL64 are straightforward. The Linux kernel also implements R_PPC64_TOC, which don't seem relevant to debugging information, and R_PPC64_REL24 and R_PPC64_REL16, which I'd prefer to have a real example of. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-18 17:56:37 -07:00
Omar Sandoval	da16a12fad	libdrgn: x86_64: implement more relocation types Implement R_X86_64_32S and R_X86_64_PC64. I haven't seen these for debug info in the wild, but they're supported by the Linux kernel and they're easy to support. The only other type of relocation currently supported by the kernel is R_X86_64_PLT32, which is trickier. For kernel modules, it's equivalent to R_X86_64_PC32 (see Linux kernel commit b21ebf2fb4cd ("x86: Treat R_X86_64_PLT32 as R_X86_64_PC32"), but that doesn't seem to be true in general. It doesn't seem applicable to debug info sections, so hopefully we don't need to worry about it. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-18 17:56:37 -07:00
Omar Sandoval	b16dad8a36	libdrgn: support SHT_REL relocations In preparation for supporting ELF relocations for more architectures, generalize ELF relocations to handle SHT_REL sections/ElfN_Rel. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-04-18 17:56:37 -07:00

1 2 3 4 5 ...

624 Commits