JakeHillion/drgn

mirror of https://github.com/JakeHillion/drgn.git synced 2024-12-26 02:25:36 +00:00

Author	SHA1	Message	Date
Omar Sandoval	d3f4f2b017	tests: create loop device for block helper tests The upcoming vmtest rework won't have any block devices, so let's add a loop device so that we always have a device to test with.	2020-03-27 10:09:41 -07:00
Omar Sandoval	79f973007b	libdrgn/python: fix reference counting on Object.type_ We need to keep the Program alive for its types to stay valid, not just the objects the Program has pinned. (I have no idea why I changed this in commit `565e0343ef` ("libdrgn: make symbol index pluggable with callbacks").)	2020-03-13 16:05:43 -07:00
Jay Kamat	3f870603fa	libdrgn: add default language to drgn_program For operations where we don't have a type available, we currently fall back to C. Instead, we should guess the language of the program and use that as the default. The heurisitic implemented here gets the language of the CU containing "main" (except for the Linux kernel, which is always C). In the future, we should allow manually overriding the automatically determined language.	2020-02-26 19:55:42 -08:00
Jay Kamat	6c264b0eae	libdrgn: add language to struct drgn_type For types obtained from DWARF, we determine it from the language of the CU. For other types, it can be specified manually or fall back to the default (C). Then, we can use the language for operations where the type is available.	2020-02-26 19:55:42 -08:00
Omar Sandoval	fe42a71116	Add DW_LANG to generated dwarf.py While we're here, make generate_dwarf_constants.py use the bundled dwarf.h, generate code that black is happy with, and use the keyword list from the standard library.	2020-02-26 19:55:42 -08:00
Omar Sandoval	a5cd92f24e	libdrgn: make vmcoreinfo accessible before loading debug info UTS_RELEASE is currently only accessible once debug info is loaded with prog.load_debug_info(main=True). This makes it difficult to get the release, find the appropriate vmlinux, then load the found vmlinux. We can add vmcoreinfo_object_find as part of set_core_dump(), which makes it possible to do the following: prog = drgn.Program() prog.set_core_dump(core_dump_path) release = prog['UTS_RELEASE'].string_() vmlinux_path = find_vmlinux(release) prog.load_debug_info([vmlinux_path]) The only downside is that this ends up using the default definition of char rather than what we would get from the debug info, but that shouldn't be a big problem.	2020-02-19 12:11:45 -08:00
Omar Sandoval	cc18d9e502	libdrgn: add UTS_RELEASE to vmcoreinfo_object_find The osrelease is accessible via init_uts_ns.name.release, but we can also get it straight out of vmcoreinfo, which will be useful for the next change. UTS_RELEASE is the name of the macro defined in the kernel.	2020-02-19 12:11:20 -08:00
Omar Sandoval	26ef465007	libdrgn/python: add proper type for members and parameters This continues the conversion from the last commit. Members and parameters are basically the same, so we can do them together. Unlike enumerators, these don't make sense to unpack or access as sequences.	2020-02-12 15:40:19 -08:00
Omar Sandoval	7c70a1a384	libdrgn/python: add proper type for enumerators Currently, type members, enumerators, and parameters are all represented by tuples in the Python bindings. This is awkward to document and implement. Instead, let's replace these tuples with proper types, starting with the easiest one, TypeEnumerator. This one still makes sense to treat as a sequence so that it can be unpacked as (name, value).	2020-02-12 15:37:41 -08:00
Jay Kamat	23c7d34099	helpers: Add get_config helper for getting kconfig map	2020-02-12 14:06:49 -08:00
Omar Sandoval	9de2cc8410	libdrgn/python: make Object.__index__() TypeError message clearer Currently, we print: >>> prog.symbol(prog['init_task']) Traceback (most recent call last): File "<console>", line 1, in <module> TypeError: cannot convert 'struct task_struct' to index It's not obvious what it means to convert to an index. Instead, let's use the error message raised by operator.index(): TypeError: 'struct task_struct' object cannot be interpreted as an integer	2020-02-11 09:19:53 -08:00
Omar Sandoval	4adc691622	helpers: add helpers for finding user_structs	2020-02-10 18:11:25 -08:00
Omar Sandoval	b5232d944d	tests: fix skipping Linux kernel tests if missing debug info I forgot to name the caught exception.	2020-02-07 14:45:34 -08:00
Serapheim Dimitropoulos	ad82e9623a	Introduce OutOfBoundsError Decouple some of the responsibilities of FaultError to OutOfBoundsError so consumers can differentiate between invalid memory accesses and running out of bounds in drgn Objects which may be based on valid memory address.	2020-02-04 14:59:31 -08:00
Omar Sandoval	6ee17e8f19	tests: add tests for cgroup helpers	2020-01-14 16:03:20 -08:00
Omar Sandoval	016f2f43c6	tests: add tests for kernfs helpers	2020-01-14 16:03:20 -08:00
Omar Sandoval	7356816f61	helpers: get rid of get_tcp_states() After thinking about this some more, I decided that although it makes sense for scripts to convert a type to an IntEnum class, I'd prefer that the helpers take and return drgn Objects rather than these classes.	2020-01-14 14:25:32 -08:00
Omar Sandoval	660276a0b8	Format Python code with Black I'm not a fan of 100% of the Black coding style, but I've spent too much time manually formatting Python code, so let's just pull the trigger.	2020-01-14 11:51:58 -08:00
Omar Sandoval	370bf6f16a	tests: add tests for new net and tcp helpers Because the vmtest kernels aren't currently built with networking support, we need to skip them if TCP isn't supported.	2020-01-02 19:43:57 -05:00
Omar Sandoval	1443d17fb4	libdrgn: add DRGN_FORMAT_OBJECT_IMPLICIT_ELEMENTS	2019-12-19 11:43:54 -08:00
Omar Sandoval	db66952b2e	libdrgn: add DRGN_FORMAT_OBJECT_IMPLICIT_MEMBERS	2019-12-19 11:43:54 -08:00
Omar Sandoval	c8434e9a9e	libdrgn: add DRGN_FORMAT_OBJECT_ELEMENT_INDICES	2019-12-19 11:43:54 -08:00
Omar Sandoval	cfceb491db	libdrgn: add DRGN_FORMAT_OBJECT_MEMBER_NAMES	2019-12-19 11:43:54 -08:00
Omar Sandoval	4fad941ec1	libdrgn: add DRGN_FORMAT_OBJECT_{MEMBERS,ELEMENTS}_SAME_LINE	2019-12-19 11:43:54 -08:00
Omar Sandoval	6bb8da04a0	libdrgn: omit trailing comma when formatting one-line array This is somewhat arbitrary, but I think it looks more natural to only use the trailing comma for multi-line initializers.	2019-12-19 11:43:54 -08:00
Omar Sandoval	d77b7bd7e3	libdrgn: add DRGN_FORMAT_OBJECT_{TYPE_NAME,MEMBER_TYPE_NAMES,ELEMENT_TYPE_NAMES}	2019-12-19 11:43:54 -08:00
Omar Sandoval	89307c532a	libdrgn: add DRGN_FORMAT_OBJECT_CHAR	2019-12-19 11:43:54 -08:00
Omar Sandoval	7cee597fff	libdrgn: add DRGN_FORMAT_OBJECT_STRING	2019-12-19 11:43:54 -08:00
Omar Sandoval	5865fa4d16	libdrgn: add DRGN_FORMAT_OBJECT_SYMBOLIZE	2019-12-19 11:43:54 -08:00
Omar Sandoval	f58bc4bf3a	libdrgn: add DRGN_FORMAT_OBJECT_DEREFERENCE	2019-12-19 11:43:54 -08:00
Omar Sandoval	cf3a07bdfb	libdrgn: python: replace Object.__format__ with Object.format_ We'd like to have more control over how objects are formatted. I considered defining a custom string format specification syntax, but that's not easily discoverable. Instead, let's get rid of the current format specification support and replace it with a normal method.	2019-12-19 11:43:52 -08:00
Omar Sandoval	c3dbb3006d	tests: remove stray TODO comment I added this as a reminder to handle errno but forgot to remove the comment when I handled errno.	2019-12-16 11:20:22 -08:00
Omar Sandoval	40e509044c	Add tests for Linux helpers We currently have no test coverage for helpers. This is a problem, as they can be fairly complicated and are susceptible to breaking with new kernel versions. It's actually not too hard to test user-facing subsystems on the running kernel as long as we're root and have debug info for vmlinux, so let's add several tests for those. Specific data structures will be a little trickier to test, so for now I'm not covering those.	2019-11-22 16:38:52 -08:00
Amlan Nayak	0df2152307	Add basic class type support This implements the first step at supporting C++: class types. In particular, this adds a new drgn_type_kind, DRGN_TYPE_CLASS, and support for parsing DW_TAG_class_type from DWARF. Although classes are not valid in C, this adds support for pretty printing them, for completeness.	2019-11-18 10:36:40 -08:00
Omar Sandoval	d60c6a1d68	libdrgn: add register information to platform In order to retrieve registers from stack traces, we need to know what registers are defined for a platform. This adds a small DSL for defining registers for an architecture. The DSL is parsed by an awk script that generates the necessary tables, lookup functions, and enum definitions.	2019-10-18 14:33:02 -07:00
Omar Sandoval	b8c657d760	libdrgn: python: add sizeof() It's annoying to do obj.type_.size, and that doesn't even work for every type. Add sizeof() that does the right thing whether it's given a Type or Object.	2019-10-18 11:47:32 -07:00
Omar Sandoval	12b0214b4d	libdrgn: work around DW_AT_upper_bound of -1 for empty arrays For the following source code: int arr[] = {}; GCC emits the following DWARF: DWARF section [ 4] '.debug_info' at offset 0x40: [Offset] Compilation unit at offset 0: Version: 4, Abbreviation section offset: 0, Address size: 8, Offset size: 4 [ b] compile_unit abbrev: 1 producer (strp) "GNU C17 9.2.0 -mtune=generic -march=x86-64 -g" language (data1) C99 (12) name (strp) "test.c" comp_dir (strp) "/home/osandov" stmt_list (sec_offset) 0 [ 1d] array_type abbrev: 2 type (ref4) [ 34] sibling (ref4) [ 2d] [ 26] subrange_type abbrev: 3 type (ref4) [ 2d] upper_bound (sdata) -1 [ 2d] base_type abbrev: 4 byte_size (data1) 8 encoding (data1) signed (5) name (strp) "ssizetype" [ 34] base_type abbrev: 5 byte_size (data1) 4 encoding (data1) signed (5) name (string) "int" [ 3b] variable abbrev: 6 name (string) "arr" decl_file (data1) test.c (1) decl_line (data1) 1 decl_column (data1) 5 type (ref4) [ 1d] external (flag_present) yes location (exprloc) [ 0] addr .bss+0 <arr> Note the DW_AT_upper_bound of -1. We end up parsing this as UINT64_MAX and returning a "DW_AT_upper_bound is too large" error. It appears that GCC is simply emitting the array length minus one, so let's treat these as having a length of zero. Fixes #19.	2019-10-18 03:18:21 -07:00
Omar Sandoval	430732093d	libdrgn: python: add converter for byteorder Rather than open-coding the conversion where we need it, make it a proper converter function.	2019-10-15 21:21:21 -07:00
Omar Sandoval	55a9700435	libdrgn: python: accept integer-like arguments in more places There are a few places (e.g., Program.symbol(), Program.read()) where it makes sense to accept, e.g., a drgn.Object with integer type. Replace index_arg() with a converter function and use it everywhere that we use the "K" format for PyArg_Parse*.	2019-10-15 21:10:11 -07:00
Omar Sandoval	181ebe1a01	Add missing entries in drgn.__all__ PlatformFlags and PrimitiveType got squashed into one string because of a missing comma, and execscript was never added. Fix it and add some test cases for it.	2019-10-03 16:50:00 -07:00
Omar Sandoval	698991b27b	Get rid of DRGN_ERROR_{ELF,DWARF}_ERROR and FileFormatError We're too inconsistent with how we use these for them to be useful (and it's impossible to distinguish between a format error and some other error from libelf/libdw/libdwfl), so let's just get rid of them and make it all DRGN_ERROR_OTHER/Exception.	2019-08-15 15:03:42 -07:00
Omar Sandoval	690b5fd650	libdrgn: generalize architecture to platform For stack trace support, we'll need to have some architecture-specific functionality. drgn's current notion of an architecture doesn't actually include the instruction set architecture. This change expands it to a "platform", which includes the ISA as well as the existing flags.	2019-08-02 00:11:56 -07:00
Omar Sandoval	0c5df56fba	libdrgn: replace symbol index with object index struct drgn_symbol doesn't really represent a symbol; it's just an object which hasn't been fully initialized (see `c2be52dff0` ("libdrgn: rename object index to symbol index"), it used to be called a "partial object"). For stack traces, we're going to have a notion of a symbol that more closely represents an ELF symbol, so let's get rid of the temporary struct drgn_symbol representation and just return an object directly.	2019-07-29 17:04:47 -07:00
Omar Sandoval	74bd59e38a	libdrgn: python: get rid of Program._symbol() We can test with Program.object() just as easily, so get rid of this undocumented method.	2019-07-29 17:04:47 -07:00
Omar Sandoval	b5b024ecac	tests: move common helpers to top-level	2019-07-29 17:04:47 -07:00
Omar Sandoval	b01d1a943f	libdrgn: python: make set_drgn_error() return void * It still always returns NULL, but now we can directly return from functions returning some PyObject subtype.	2019-07-28 00:58:36 -07:00
Omar Sandoval	0a74a610bc	libdrgn: python: only repr() one level of type members Currently, repr() of structure and union types goes arbitrarily deep (except for cycles). However, for lots of real-world types, this is easily deeper than Python's recursion limit, so we can't get a useful repr() at all: >>> repr(prog.type('struct task_struct')) Traceback (most recent call last): File "<console>", line 1, in <module> RecursionError: maximum recursion depth exceeded while getting the repr of an object Instead, only print one level of structure and union types.	2019-07-27 15:04:31 -07:00
Omar Sandoval	67a16a09b8	tests: test that Python documentation renders A couple of times, I've broken help(drgn) by formatting a function signature in a way that the inspect module doesn't understand (namely, it crashes on Enum default arguments). Let's add a simple test that the documentation at least renders.	2019-07-24 11:01:35 -07:00
Omar Sandoval	e5874ad18a	libdrgn: use libdwfl libdwfl is the elfutils "DWARF frontend library". It has high-level functionality for looking up symbols, walking stack traces, etc. In order to use this functionality, we need to report our debugging information through libdwfl. For userspace programs, libdwfl has a much better implementation than drgn for automatically finding debug information from a core dump or PID. However, for the kernel, libdwfl has a few issues: - It only supports finding debug information for the running kernel, not vmcores. - It determines the vmlinux address range by reading /proc/kallsyms, which is slow (~70ms on my machine). - If separate debug information isn't available for a kernel module, it finds it by walking /lib/modules/$(uname -r)/kernel; this is repeated for every module. - It doesn't find kernel modules with names containing both dashes and underscores (e.g., aes-x86_64). Luckily, drgn already solved all of these problems, and with some effort, we can keep doing it ourselves and report it to libdwfl. The conversion replaces a bunch of code for dealing with userspace core dump notes, /proc/$pid/maps, and relocations.	2019-07-15 12:27:48 -07:00
Omar Sandoval	e73346b488	libdrgn: generalize IS_RUNNING_KERNEL flag to IS_LIVE I.e., also flag running processes as live.	2019-07-08 16:55:54 -07:00

1 2 3 4

168 Commits