JakeHillion/drgn

mirror of https://github.com/JakeHillion/drgn.git synced 2024-12-23 17:53:07 +00:00

Author	SHA1	Message	Date
Omar Sandoval	b05cc0eb75	libdrgn: use libkdumpfile for ELF vmcores when available vmcores don't include program headers for special memory regions like vmalloc and percpu. Instead, we need to walk the kernel page table to map those addresses. Luckily, libkdumpfile already does that. So, if drgn was built with libkdumpfile support, use it for ELF vmcores. Also add an environment variable to override this behavior. Closes #15.	2019-10-02 17:15:36 -07:00
Omar Sandoval	191c5ae253	libelf: clean up SHF_COMPRESSED handling We don't need the ifdef anymore since we're using the elf.h from our local elfutils. We can also fold a leftover nested if.	2019-09-24 17:16:17 -07:00
Omar Sandoval	ca9cdc1991	libdrgn: autogenerate docstrings.h I didn't want to use BUILT_SOURCES before because that would break make $TARGET. But, now that doesn't work anyways because we're using SUBDIRS, so we might as well use BUILT_SOURCES.	2019-09-19 11:08:04 -07:00
Omar Sandoval	aa4bfd646f	libdrgn: simplify gen_constants.py header search Instead of passing in a directory for header files, add -iquote for that directory.	2019-09-19 11:08:04 -07:00
Omar Sandoval	6a13d74c0c	libdrgn: build with bundled elfutils Now that we have the bundled version of elfutils, build it from libdrgn and link to it. We can also get rid of the elfutils version checks from the libdrgn code.	2019-09-19 11:07:12 -07:00
Omar Sandoval	1cedca8ff4	Import elfutils Based on: c950e8a9 config: Fix spec file, add manpages and new GFDL license. With the following patches: configure: Add --disable-programs configure: Add --disable-shared configure: Fix -D_FORTIFY_SOURCE=2 check when CFLAGS contains -Wno-error libcpu: compile i386_lex.c with -Wno-implicit-fallthrough The plan is to stop relying on the distribution's version of elfutils and instead ship our own. This gives us freedom to assume that we're using the latest version and even ship our own patches (starting with a few build system improvements). More details are in scripts/update-elfutils.sh, which was used to generate this commit.	2019-09-05 01:04:33 -07:00
Omar Sandoval	f11a8766bf	setup.py: get list of source files from git Currently, we have a special Makefile target to output the files for a libdrgn source tarball, and we use that for setuptools. However, the next change is going to import elfutils, and it'd be a pain to add the same thing for the elfutils sources. Instead, let's just use git ls-files for everything. The only difference is that source distributions won't have the autoconf/automake output.	2019-09-03 17:19:02 -07:00
Omar Sandoval	a3f4fe0518	libdrgn: handle get_debug_sections() errors per-module There's no reason to fail indexing just because one file is missing debug information.	2019-08-29 12:26:40 -07:00
Omar Sandoval	62d98b3016	libdrgn: fold ELF relocation code into dwarf_index I started with drgn_elf_relocator as a separate interface to parallelize by relocation. However, the final result is parallelized by file, which means that it can be done as part of the main read_cus() loop. Get rid of the elf_relocator interface and do it in dwarf_index.c instead. This means that if/when libdwfl gets faster at ELF relocations, we can rip out the relocation code without any other changes.	2019-08-29 12:26:22 -07:00
Omar Sandoval	698991b27b	Get rid of DRGN_ERROR_{ELF,DWARF}_ERROR and FileFormatError We're too inconsistent with how we use these for them to be useful (and it's impossible to distinguish between a format error and some other error from libelf/libdw/libdwfl), so let's just get rid of them and make it all DRGN_ERROR_OTHER/Exception.	2019-08-15 15:03:42 -07:00
Omar Sandoval	10142f922f	Add basic stack trace support For now, we only support stack traces for the Linux kernel (at least v4.9) on x86-64, and we only support getting the program counter and corresponding function symbol from each stack frame.	2019-08-02 00:26:28 -07:00
Serapheim Dimitropoulos	93d7ea9f01	Add support for kdump-compressed core dumps with libkdumpfile	2019-08-02 00:20:16 -07:00
Omar Sandoval	690b5fd650	libdrgn: generalize architecture to platform For stack trace support, we'll need to have some architecture-specific functionality. drgn's current notion of an architecture doesn't actually include the instruction set architecture. This change expands it to a "platform", which includes the ISA as well as the existing flags.	2019-08-02 00:11:56 -07:00
Omar Sandoval	71e6744210	libdrgn: add symbol table interface Now that we're not overloading the name "symbol", we can define struct drgn_symbol as a symbol table entry. For now, this is very minimal: it's just a name, address, and size. We can then add a way to find the symbol for a given address, drgn_program_find_symbol(). For now, this is only supported through the actual ELF symbol tables. However, in the future, we can probably support adding "symbol finders".	2019-07-30 09:25:34 -07:00
Omar Sandoval	0c5df56fba	libdrgn: replace symbol index with object index struct drgn_symbol doesn't really represent a symbol; it's just an object which hasn't been fully initialized (see `c2be52dff0` ("libdrgn: rename object index to symbol index"), it used to be called a "partial object"). For stack traces, we're going to have a notion of a symbol that more closely represents an ELF symbol, so let's get rid of the temporary struct drgn_symbol representation and just return an object directly.	2019-07-29 17:04:47 -07:00
Omar Sandoval	74bd59e38a	libdrgn: python: get rid of Program._symbol() We can test with Program.object() just as easily, so get rid of this undocumented method.	2019-07-29 17:04:47 -07:00
Omar Sandoval	62ff4e1dba	libdrgn: indicate finder lookup failure with special error Currently, finders indicate a non-fatal lookup error by setting the type member to NULL. This won't work when we replace the symbol finder with an object finder (which shouldn't modify the object on failure). Instead, use a static error for this purpose.	2019-07-29 17:04:47 -07:00
Omar Sandoval	0cb77b303c	libdrgn: work around Clang __muloti4 again See `2dd14ad522` ("libdrgn: work around "undefined reference to '__muloti4'" when using Clang").	2019-07-29 17:03:45 -07:00
Omar Sandoval	b01d1a943f	libdrgn: python: make set_drgn_error() return void * It still always returns NULL, but now we can directly return from functions returning some PyObject subtype.	2019-07-28 00:58:36 -07:00
Omar Sandoval	0a74a610bc	libdrgn: python: only repr() one level of type members Currently, repr() of structure and union types goes arbitrarily deep (except for cycles). However, for lots of real-world types, this is easily deeper than Python's recursion limit, so we can't get a useful repr() at all: >>> repr(prog.type('struct task_struct')) Traceback (most recent call last): File "<console>", line 1, in <module> RecursionError: maximum recursion depth exceeded while getting the repr of an object Instead, only print one level of structure and union types.	2019-07-27 15:04:31 -07:00
Omar Sandoval	d63125f133	libdrgn: python: make Program.object() flags optional Default to FindObjectFlags.ANY.	2019-07-24 11:02:34 -07:00
Omar Sandoval	06cce1baa1	libdrgn: fix typo in drgn_enomem documentation	2019-07-22 17:23:27 -07:00
Omar Sandoval	3e95e88028	libdrgn: vector: protect against overflow when doubling capacity It seems extremely unlikely that we'd actually overflow before we run out of memory, but let's just be safe.	2019-07-19 09:27:20 -07:00
Omar Sandoval	27a27940bc	libdrgn: split up drgn_program_get_dwarf() We don't need to get the DWARF index at the time we get the Dwfl handle, so get rid of drgn_program_get_dwarf(), add drgn_program_get_dwfl(), and create the DWARF index right before we update in a new function, drgn_program_update_dwarf_index().	2019-07-19 09:26:30 -07:00
Omar Sandoval	a17215e984	libdrgn: dwarf_index: fix memcpy() undefined behavior Apparently, it's undefined behavior to pass NULL as the source to memcpy(), even if the length is zero. It's an easy fix, so let's appease UBSan.	2019-07-15 12:27:48 -07:00
Omar Sandoval	1d4854a5bc	libdrgn: implement optimized x86-64 ELF relocations After the libdwfl conversion, we apply ELF relocations with libdwfl instead of our homegrown implementation. However, libdwfl is much slower at it than the previous implementation. We can work around this by (again) applying ELF relocations ourselves for architectures that we care about (x86-64, to start). For other architectures, we can fall back to libdwfl. This new implementation of ELF relocation reworks the parallelization to be per-file rather than per-relocation. The latter was done originally because before commit `6f16ab09d6` ("libdrgn: only apply ELF relocations to relocatable files"), we applied relocations to vmlinux, which is much larger than most kernel modules. Now that we don't do that, it seems to be slightly faster to parallelize by file.	2019-07-15 12:27:48 -07:00
Omar Sandoval	e5874ad18a	libdrgn: use libdwfl libdwfl is the elfutils "DWARF frontend library". It has high-level functionality for looking up symbols, walking stack traces, etc. In order to use this functionality, we need to report our debugging information through libdwfl. For userspace programs, libdwfl has a much better implementation than drgn for automatically finding debug information from a core dump or PID. However, for the kernel, libdwfl has a few issues: - It only supports finding debug information for the running kernel, not vmcores. - It determines the vmlinux address range by reading /proc/kallsyms, which is slow (~70ms on my machine). - If separate debug information isn't available for a kernel module, it finds it by walking /lib/modules/$(uname -r)/kernel; this is repeated for every module. - It doesn't find kernel modules with names containing both dashes and underscores (e.g., aes-x86_64). Luckily, drgn already solved all of these problems, and with some effort, we can keep doing it ourselves and report it to libdwfl. The conversion replaces a bunch of code for dealing with userspace core dump notes, /proc/$pid/maps, and relocations.	2019-07-15 12:27:48 -07:00
Omar Sandoval	a9a2cb7cac	libdrgn: dwarf_index: move bswap from file to compilation unit Remove an indirection.	2019-07-15 12:27:38 -07:00
Omar Sandoval	1c9ab2e7d1	libdrgn: dwarf_index: fix leak of DWARF index entries on failure We're forgetting to unchain new entries which are chained on old entries.	2019-07-15 12:27:36 -07:00
Omar Sandoval	996d3094ef	libdrgn: dwarf_index: fold unindex_files() into index_cus()	2019-07-15 12:27:33 -07:00
Omar Sandoval	b7e1b6ede6	libdrgn: dwarf_index: rename drgn_dwarf_index_iterator_next() output parameter	2019-07-15 12:27:24 -07:00
Omar Sandoval	d423361d8a	libdrgn: dwarf_index: move .debug_str null-termination check Check it right after we read the section instead of when updating the index.	2019-07-15 12:27:18 -07:00
Omar Sandoval	9f9bec4762	libdrgn: use common vector where applicable This converts several open-coded dynamic arrays to the new common vector implementation: - drgn_lexer stack - Array dimension array for DWARF parsing - drgn_program_read_c_string() - DWARF index directory name hashes - DWARF index file name hashes - DWARF index abbreviation table - DWARF index shard entries	2019-07-15 12:27:16 -07:00
Omar Sandoval	8d52536271	libdrgn: add common vector implementation drgn has enough open-coded dynamic arrays at this point to warrant a common implementation. Add one inspired by hash_table.h. The API is pretty minimal. I'll add more to it as the need arises.	2019-07-15 12:27:15 -07:00
Omar Sandoval	e2a27e7f59	libdrgn: add drgn_error_format_os() There are some cases where we format a path (e.g., with asprintf()) and keep it around only in case of errors. Add drgn_error_format_os() so we can just reformat it if we hit the error, which simplifies cleanup.	2019-07-11 16:19:18 -07:00
Omar Sandoval	74c0aa8612	libdrgn: reorder drgn_error_create_os() arguments To make it more consistent with the upcoming drgn_error_format_os().	2019-07-11 16:12:56 -07:00
Omar Sandoval	ce808440f7	libdrgn: move string_builder_line_break() to string_builder.c	2019-07-11 15:33:10 -07:00
Omar Sandoval	0ebcfc8178	libdrgn: use drgn_stop error in DWARF index The (struct drgn_error *)-1 hack predates DRGN_ERROR_STOP. Get rid of the hack.	2019-07-11 10:53:13 -07:00
Omar Sandoval	5b2da5e682	libdrgn: get rid of unnecessary gelf_getshdr() in read_elf_section()	2019-07-11 09:53:56 -07:00
Omar Sandoval	e73346b488	libdrgn: generalize IS_RUNNING_KERNEL flag to IS_LIVE I.e., also flag running processes as live.	2019-07-08 16:55:54 -07:00
Omar Sandoval	129f1493b8	libdrgn: split kernel-specific stuff out of program.c Almost half of program.c is stuff specific to the Linux kernel, so let's separate that out (and combine it with the existing kernel module code).	2019-07-08 16:53:58 -07:00
Omar Sandoval	a0fc02efd3	libdrgn: don't store version in struct compilation_unit We don't use it after checking it.	2019-07-08 16:23:38 -07:00
Omar Sandoval	8a59a7e819	libdrgn: don't preallocate DWARF index memory This doesn't make things any faster in my benchmarks, and it complicates DWARF index initialization.	2019-07-08 16:23:38 -07:00
Omar Sandoval	ec33f9bf73	libdrgn: get rid of DWARF index flags We always index everything, so simplify the code a bit.	2019-07-08 16:23:38 -07:00
Omar Sandoval	426ee1e57c	libdrgn/python: add missing name in Symbol argument parsing	2019-06-29 01:38:36 -07:00
Omar Sandoval	97ebc2a57c	libdrgn/python: add Program.cache For caching metadata between invocations of helpers (e.g., detected kernel version differences or config options).	2019-06-28 16:15:07 -07:00
Omar Sandoval	25e7a9d3b8	libdrgn/python: implement Program.__contains__	2019-06-28 16:02:52 -07:00
Omar Sandoval	f55158c74c	libdrgn: add PAGE_{SHIFT,SIZE,MASK} symbols from vmcoreinfo Since we currently don't parse DWARF macro information, there's no easy way to get the value PAGE_SIZE and friends in drgn. However, vmcoreinfo contains the value of PAGE_SIZE, so let's add a special symbol finder that returns that.	2019-05-29 00:02:48 -07:00
Omar Sandoval	1614b1e6f6	libdrgn: add better vmcoreinfo fallback Currently, if we don't get vmcoreinfo from /proc/kcore, and we can't get it from /sys/kernel/vmcoreinfo, then we manually determine the kernel release and KASLR offset. This has a couple of issues: 1. We look for vmlinux to determine the KASLR offset, which may not be in a standard location. 2. We might want to start using other information from vmcoreinfo which can't be determined as easily. Instead, we can get the virtual address of vmcoreinfo from /proc/kallsyms and read it directly from there.	2019-05-28 15:54:49 -07:00
Omar Sandoval	8e45a305fb	libdrgn: fix file/memory leak in proc_kallsyms_symbol_addr()	2019-05-28 15:01:35 -07:00

1 2 3

115 Commits