JakeHillion/drgn

mirror of https://github.com/JakeHillion/drgn.git synced 2024-12-24 02:03:05 +00:00

Author	SHA1	Message	Date
Omar Sandoval	2345325ac1	drgndoc: handle implicit classmethods The __init_subclass__ and __class_getitem__ methods are always class methods even if not decorated as such, so format them accordingly. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-08-11 23:18:42 -07:00
Omar Sandoval	b8aa2dcfc5	drgndoc: format None, True, and False as roles Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-08-11 22:51:56 -07:00
Omar Sandoval	be85631471	travis: fix spurious VM crashes Every few builds or so, a vmtest VM crashes after printing "x86: Booting SMP configuration:". After some difficult debugging, I determined that the crash happens in arch/x86/realmode/rm/trampoline_64.S (the code that initializes secondary CPUs) at the ljmp from startup_32 to startup_64. The real problem happens earlier in startup_32: movl $pa_trampoline_pgd, %eax movl %eax, %cr3 Sometimes, the store to CR3 "fails" and CR3 remains zero, which causes the later ljmp to triple fault. This can be reproduced by the following script: #!/bin/sh curl -L 'https://www.dropbox.com/sh/2mcf2xvg319qdaw/AABFKsISWRpndNZ1gz60O-qSa/x86_64/vmlinuz-5.8.0-rc7-vmtest1?dl=1' -o vmlinuz cat > commands.gdb << "EOF" set confirm off target remote :1234 # arch/x86/realmode/rm/trampoline_64.S:startup_32 after CR3 store. hbreak 0x9ae09 if $cr3 == 0 command info registers eax cr3 quit 1 end # kernel/smp.c:smp_init() after all CPUs have been brought up. If we get here, # the bug wasn't triggered. hbreak 0xffffffff81ed4484 command kill quit 0 end continue EOF while true; do qemu-system-x86_64 -cpu host -enable-kvm -smp 64 -m 128M \ -nodefaults -display none -serial file:/dev/stdout -no-reboot \ -kernel vmlinuz -append 'console=0,115200 panic=-1 nokaslr' \ -s -S & gdb -batch -x commands.gdb \|\| exit 1 done This seems to be a problem with nested virtualization that was fixed by Linux kernel commit b4d185175bc1 ("KVM: VMX: give unrestricted guest full control of CR3") (in v4.17). Apparently, the Google Cloud hosts that Travis runs on are missing this fix. We obviously can't patch those hosts, but we can work around it. Disabling unrestricted guest support in the Travis VM causes CR3 stores in the nested vmtest VM to be emulated, bypassing the bug. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-08-04 16:36:09 -07:00
Omar Sandoval	20bcde1f1d	drgn 0.0.7 Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-27 23:32:32 -07:00
Omar Sandoval	9f3fadd3de	README: use code-block instead of highlight PyPI's RST parser apparently doesn't know the highlight directive, which snuck into the README in commit `4de147e478` ("Add CONTRIBUTING.rst"). Use code-block instead. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-27 23:28:23 -07:00
Omar Sandoval	025989871b	drgn 0.0.6 Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-27 17:25:54 -07:00
Omar Sandoval	e3309765f9	helpers: add kaslr_offset() and move pgtable_l5_enabled() Make the KASLR offset available to Python in a new drgn.helpers.linux.boot module, and move pgtable_l5_enabled() there, too. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-27 17:00:16 -07:00
Omar Sandoval	d118fda740	vmtest: check that downloaded file is not truncated My work VPN is apparently closing HTTP connections prematurely, which exposed that urllib won't catch incomplete reads if copied through shutil.copyfileobj(). Check it explicitly. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-27 12:19:46 -07:00
Omar Sandoval	d45aafe43c	vmtest: manage: remove 3.16 blacklist 3.16 is EOL and no longer included in the list of releases. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-27 12:00:35 -07:00
Omar Sandoval	27e73fb84b	docs: fix broken link to drgn.h drgn.h is generated from drgn.h.in since commit `d60c6a1d68` ("libdrgn: add register information to platform"). Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-27 11:56:36 -07:00
Omar Sandoval	e7f353c118	libdrgn: hash_table: clean up coding style Clean up the coding style of the remaining few places that the last couple of changes didn't rewrite. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-18 11:53:05 -07:00
Omar Sandoval	f94b0262c6	libdrgn: hash_table: implement vector storage policy The folly F14 implementation provides 3 storage policies: value, node, and vector. The default F14FastMap/F14FastSet chooses between the value and vector policies based on the value size. We currently only implement the value policy, as the node policy is easy to emulate and the vector policy would've added more complexity. This adds support for the vector policy (adding even more C abuse :) and automatically chooses the policy the same way as folly. It'd be easy to add a way to choose the policy if needed. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-18 11:53:00 -07:00
Omar Sandoval	9ea11a7c26	libdrgn: hash_table: port reserve optimization The only major change to the folly F14 implementation since I originally ported it is commit 3d169f4365cf ("memory savings for F14 tables with explicit reserve()"). That is a small improvement for small tables and a large improvement for vector tables, which are about to be added. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-18 01:42:37 -07:00
Omar Sandoval	2eab47ce9e	libdrgn: hash_table: use posix_memalign() instead of aligned_alloc() posix_memalign() doesn't have the restriction that the size must be a multiple of the alignment like aligned_alloc() does in C11. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-18 01:42:37 -07:00
Omar Sandoval	2409868409	libdrgn: hash_table: define chunk alignment constant Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-18 01:42:37 -07:00
Omar Sandoval	209eaee485	setup.py: import setuptools before distutils setuptools recently started warning if distutils is imported before it. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-18 01:41:12 -07:00
Omar Sandoval	6d4af7e17e	libdrgn: dwarf_info_cache: handle variables DW_AT_const_value Compile-time constants have DW_AT_const_value instead of DW_AT_location. We can translate those to a value object. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-13 15:23:51 -07:00
Omar Sandoval	213c148ce6	libdrgn: dwarf_info_cache: handle DW_AT_endianity Variables can have a non-default endianity. Handle it and clean up variable endian handling. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-13 14:26:58 -07:00
Omar Sandoval	c840072d05	libdrgn: make drgn_object_set_buffer() take a void * It's awkward to make callers cast to char *. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-13 10:25:03 -07:00
Omar Sandoval	f1eaf5b14c	libdrgn: add load_debug_info example program Really it's more of a test program than an example program. It's useful for benchmarking, testing with valgrind, etc. It's not built by default, but it can be built manually with: $ make -C build/temp.* examples/load_debug_info And run with: $ ./build/temp.*/examples/load_debug_info Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-10 16:18:58 -07:00
Omar Sandoval	3028da4d1d	libdrgn: compare language in drgn_type_eq() Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-08 22:07:49 -07:00
Omar Sandoval	1409b56d24	travis.yml: remove unnecessary sudo from echo	2020-07-08 18:39:37 -07:00
Omar Sandoval	27744108e1	setup.py: add 5.8 to vmtest kernels	2020-07-08 18:39:37 -07:00
Omar Sandoval	95be142d17	tests: disable THREAD_SIZE test GCC 10 doesn't generate a DIE for union thread_union, which breaks our THREAD_SIZE object finder. The previous change removed our internal dependency on THREAD_SIZE, so disable this test while I investigate why GCC changed.	2020-07-08 18:36:10 -07:00
Omar Sandoval	1b47b866b4	libdrgn: go back to trusting PRSTATUS PID Commit `eea5422546` ("libdrgn: make Linux kernel stack unwinding more robust") overlooked that if the task is running in userspace, the stack pointer in PRSTATUS obviously won't match the kernel stack pointer. Let's bite the bullet and use the PID. If the race shows up in practice, we can try to come up with another workaround.	2020-07-08 18:34:16 -07:00
Omar Sandoval	4de147e478	Add CONTRIBUTING.rst This documents best practices for contributing to drgn. We now require a DCO sign-off. Also clean up some related areas in the documentation. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-07-07 17:44:02 -07:00
Omar Sandoval	293418294a	libdrgn: assume compiler uses sane integer implementation I once tried to implement a generic arithmetic right shift macro without relying on any implementation-defined behavior, but this turned out to be really hard. drgn is fairly tied to GCC and GCC-compatible compilers (like Clang), so let's just assume GCC's model [1]: modular conversion to signed types, two's complement signed bitwise operators, and sign extension for signed right shift. 1: https://gcc.gnu.org/onlinedocs/gcc/Integers-implementation.html	2020-07-07 17:18:17 -07:00
Omar Sandoval	948cda2941	libdrgn: add vector/hash table initializers and update coding style Declaring a local vector or hash table and separately initializing it with vector_init()/hash_table_init() is annoying. Add macros that can be used as initializers. This exposes several places where the C89 style of placing all declarations at the beginning of a block is awkward. I adopted this style from the Linux kernel, which uses C89 and thus requires this style. I'm now convinced that it's usually nicer to declare variables where they're used. So let's officially adopt the style of mixing declarations and code (and ditch the blank line after declarations) and update the functions touched by this change.	2020-07-01 12:48:24 -07:00
Omar Sandoval	e4c52c5422	libdrgn: linux_kernel: use names for kmod index constants This makes it much easier to follow along with the code and understand the format.	2020-06-30 15:14:21 -07:00
Omar Sandoval	03d8cb0e32	libdrgn: fix hash_pair_from_non_avalanching_hash() on 64-bit without SSE 4.2 We were forgetting to mask away the extra bits. There are two places that we use the tag without converting it to a uint8_t: hash_table_probe_delta(), which is mostly benign since we mask it by the chunk mask anyways; and table_chunk_match() without SSE 2, which completely breaks. While we're here, let's align the comments better.	2020-06-24 13:33:08 -07:00
Omar Sandoval	8e7c1f1009	drgn 0.0.5	2020-05-26 10:04:06 -07:00
Omar Sandoval	a227d0d50e	Update elfutils and revert activation frame patch After thinking about it some more, I realized that "libdwfl: simplify activation frame logic" breaks the case where during unwinding someone queries isactivation for reasons other than knowing whether to decrement program counter. Revert the patch and refactor "libdwfl: add interface for getting Dwfl_Module and Dwarf_Frame for Dwfl_Frame" to handle it differently. Based on: c95081596 size: Also obey radix printing for bsd format. With the following patches: configure: Add --disable-programs configure: Add --disable-shared libdwfl: add interface for attaching to/detaching from threads libdwfl: export __libdwfl_frame_reg_get as dwfl_frame_register libdwfl: add interface for getting Dwfl_Module and Dwarf_Frame for Dwfl_Frame libdwfl: add interface for evaluating DWARF expressions in a frame	2020-05-20 13:38:49 -07:00
Omar Sandoval	eea5422546	libdrgn: make Linux kernel stack unwinding more robust drgn has a couple of issues unwinding stack traces for kernel core dumps: 1. It can't unwind the stack for the idle task (PID 0), which commonly appears in core dumps. 2. It uses the PID in PRSTATUS, which is racy and can't actually be trusted. The solution for both of these is to look up the PRSTATUS note by CPU instead of PID. For the live kernel, drgn refuses to unwind the stack of tasks in the "R" state. However, the "R" state is running or runnable, so in the latter case, we can still unwind the stack. The solution for this is to look at on_cpu for the task instead of the state.	2020-05-20 12:03:00 -07:00
Omar Sandoval	146930aff8	libdrgn: replace arch frame_registers with callbacks We currently unwind from pt_regs and NT_PRSTATUS using an array of register definitions. It's more flexible and more efficient to do this with an architecture-specific callback. For x86-64, this change also makes us depend on the binary layout rather than member names of struct pt_regs, but that shouldn't matter unless people are defining their own, weird struct pt_regs.	2020-05-19 17:11:27 -07:00
Omar Sandoval	4d8597f0f8	libdrgn: add THREAD_SIZE to Linux kernel object finder Despite the naming, this is the kernel stack size.	2020-05-19 17:10:54 -07:00
Omar Sandoval	971a2d3687	libdrgn/python: make Objects fully immutable The model has always been that drgn Objects are immutable, but for some reason I went through the trouble of allowing __init__() to reinitialize an already initialized Object. Instead, let's fully initialize the Object in __new__() and get rid of __init__().	2020-05-18 00:07:49 -07:00
Omar Sandoval	ab876f3dbd	libdrgn/python: allow specifying Object value positionally It's annoying to have to do value= when creating objects, especially in interactive mode. Let's allow passing in the value positionally so that `Object(prog, "int", value=0)` becomes `Object(prog, "int", 0)`. It's clear enough that this is creating an int with value 0.	2020-05-18 00:07:49 -07:00
Omar Sandoval	1cc771605d	tests: run black on stray change	2020-05-15 15:19:11 -07:00
Omar Sandoval	8b264f8823	Update copyright headers to Facebook and add missing headers drgn was originally my side project, but for awhile now it's also been my work project. Update the copyright headers to reflect this, and add a copyright header to various files that were missing it.	2020-05-15 15:13:02 -07:00
Omar Sandoval	c339113f9c	libdrgn: adjust program counter when looking up frame symbol For functions that call a noreturn function, the compiler may omit code after the call instruction. This means that the return address may not lie in the caller's symbol. dwfl_frame_pc() returns whether a frame is an "activation", i.e., its program counter is guaranteed to lie within the caller. This is only the case for the initial frame, frames interrupted by a signal, and the signal trampoline frame. For everything else, we need to decrement the program counter before doing any lookups.	2020-05-13 17:11:54 -07:00
Omar Sandoval	175f83fc23	Update elfutils with noreturn unwinding fix Rebase on master and fix dwfl_frame_module/dwfl_frame_dwarf_frame to decrement the program counter when necessary. Based on: a8493c12a libdw: Skip imported compiler_units in libdw_visit_scopes walking DIE tree With the following patches: configure: Add --disable-programs configure: Add --disable-shared libdwfl: simplify activation frame logic libdwfl: add interface for attaching to/detaching from threads libdwfl: add interface for getting Dwfl_Module and Dwarf_Frame for Dwfl_Frame libdwfl: export __libdwfl_frame_reg_get as dwfl_frame_register libdwfl: add interface for evaluating DWARF expressions in a frame	2020-05-13 16:41:52 -07:00
Omar Sandoval	bf545105c6	libdrgn: build in silent mode by default The automake/libtool compilation output is obnoxiously verbose. Switch on automake's silent mode, and make the custom rules honor it.	2020-05-10 00:12:50 -07:00
Omar Sandoval	62ddc96350	setup.py: add 5.7 to vmtest kernels 5.7 is up to rc4 (oops). Better late than never.	2020-05-08 18:13:12 -07:00
Omar Sandoval	2d1481f5ab	libdrgn: add page table walker kernel memory reader Now that we can walk page tables, we can use it in a memory reader that reads kernel memory via the kernel page table. This means that we don't need libkdumpfile for ELF vmcores anymore (although I'll keep the functionality around until this code has been validated more).	2020-05-08 17:37:56 -07:00
Omar Sandoval	e697be707c	libdrgn: use swapper_pg_dir in vmcoreinfo for fallback PAGE_OFFSET I originally wanted to avoid depending on another vmcoreinfo field, but an the next change is going to depend on swapper_pg_dir in vmcoreinfo anyways, and it ends up being simpler to use it.	2020-05-08 17:37:56 -07:00
Omar Sandoval	4b82d1e075	helpers: add cmdline() and environ() These are two of the most common use cases for reading a process's memory.	2020-05-08 17:37:56 -07:00
Omar Sandoval	8a276838ac	helpers: add access_process_vm() and access_remote_vm() Now that we can walk page tables, we can finally read memory from userspace tasks. Closes #53.	2020-05-08 17:37:01 -07:00
Omar Sandoval	d0a1718451	libdrgn: implement virtual address translation/page table walking There are a few big use cases for this in drgn: * Helpers for accessing memory in the virtual address space of userspace tasks. * Removing the libkdumpfile dependency for vmcores. * Handling gaps in the virtual address space of /proc/kcore (cf. #27). I dragged my feet on implementing this because I thought it would be more complicated, but the page table layout on x86-64 isn't too bad. This commit implements page table walking using a page table iterator abstraction. The first thing we'll add on top of this will be a helper for reading memory from a virtual address space, but in the future it'd also be possible to export the page table iterator directly.	2020-05-08 17:36:19 -07:00
Omar Sandoval	63299e0701	libdrgn: actually use uint64_t for two's complement unary ops UNARY_OP_SIGNED_2C() uses a union of int64_t and uint64_t to avoid signed integer overflow... except that there's a typo and the uint64_t is actually an int64_t. Fix it and add a test that would catch it with -fsanitize=undefined.	2020-05-08 13:50:24 -07:00
Omar Sandoval	8f81ea255f	libdrgn: don't use unaligned loads to parse DWARF -fsanitize=undefined reports that the read_u* helpers rely on unaligned loads. Use memcpy() instead.	2020-05-08 13:50:24 -07:00

1 2 3 4 5 ...

808 Commits