JakeHillion/drgn

mirror of https://github.com/JakeHillion/drgn.git synced 2024-12-22 17:23:06 +00:00

Author	SHA1	Message	Date
Omar Sandoval	2d8aeacb30	Allow naming and configuring order of object finders This one doesn't need any changes to the callback signature, just the new interface. We also keep add_object_finder() for compatibility. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2024-06-05 13:40:26 -07:00
Omar Sandoval	5c9797a633	Allow naming and configuring order of type finders Like for symbol finders, we want extra flexibility around configuring type finders. The type finder callback signature also has a couple of warts: it doesn't take the program since it was added before types needed to be constructed from a program, and it is called separately for each type kind since originally type lookups were for only one kind. While we're adding a new interface, let's fix these warts: pass the program and a set of type kinds. However, we need to keep the old add_type_finder() interface for backwards compatibility. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2024-06-05 13:40:26 -07:00
Omar Sandoval	9f9f945a31	tests: expose _AssertIdenticalWrapper as IdenticalMatcher This is for upcoming tests using the unittest.mock.Mock.assert_called_with() family of functions, which compares with ==. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2023-11-08 13:24:11 -08:00
Omar Sandoval	60a289fdff	tests: make identical() stricter Require exact type matches, not subclasses, and fail hard for types we don't explicitly handle. This caught one place where we weren't testing what we thought we were. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2023-11-08 13:18:44 -08:00
Omar Sandoval	68fc40fe85	tests: replace setenv() with more general modifyenv() No functional change for existing tests, but it will be used for future tests. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2023-10-11 14:45:06 -07:00
Omar Sandoval	101639caaf	tests: don't compare values of reference objects in assertIdentical() The value can change between two reads. This caused test failures in the previous commit `18a8f69ad8` ("libdrgn: linux_kernel: add object finder for jiffies"). The important thing is that the addresses are the same. Fixes: `75c3679147` ("Rewrite drgn core in C") Signed-off-by: Omar Sandoval <osandov@osandov.com>	2023-02-22 13:33:47 -08:00
Omar Sandoval	87b7292aa5	Relicense drgn from GPLv3+ to LGPLv2.1+ drgn is currently licensed as GPLv3+. Part of the long term vision for drgn is that other projects can use it as a library providing programmatic interfaces for debugger functionality. A more permissive license is better suited to this goal. We decided on LGPLv2.1+ as a good balance between software freedom and permissiveness. All contributors not employed by Meta were contacted via email and consented to the license change. The only exception was the author of commit `c4fbf7e589` ("libdrgn: fix for compilation error"), who did not respond. That commit reverted a single line of code to one originally written by me in commit `640b1c011d` ("libdrgn: embed DWARF index in DWARF info cache"). Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-11-01 17:05:16 -07:00
Shung-Hsi Yu	e8d0c85811	test: add test for _repr_pretty_() method Since _repr_pretty_() uses output of str(), and the latter is already heavily tested in tests/test_language_c.py, we can simply test whether p.text() is called instead of duplicating all the test cases. Signed-off-by: Shung-Hsi Yu <shung-hsi.yu@suse.com>	2022-08-25 13:52:28 -07:00
Omar Sandoval	085e6f3078	tests: remove no-op setUp() method Commit `7d7aa7bf7b` ("libdrgn/python: remove Type == operator") removed the substantial part of tests.TestCase.setUp() but didn't remove the method. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2022-01-22 11:12:52 -08:00
Omar Sandoval	c0d8709b45	Update copyright headers to Meta Signed-off-by: Omar Sandoval <osandov@osandov.com>	2021-11-21 15:59:44 -08:00
Omar Sandoval	5541fad063	Fix some flake8 errors Mainly unused imports, unused variables, unnecessary f-strings, and regex literals missing the r prefix. I'm not adding it to the CI linter because it's too noisy, though. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2021-08-11 14:52:44 -07:00
Omar Sandoval	ffcb9ccb19	libdrgn: debug_info: implement creating objects from DWARF location descriptions Add support for evaluating a DWARF location description and translating it into a drgn object. In this commit, this is just used for global variables, but an upcoming commit will wire this up to stack traces for parameters and local variables. There are a few locations that drgn's object model can't represent yet. DW_OP_piece/DW_OP_bit_piece can describe objects that are only partially known or partially in memory; we approximate these where we can. We don't have a good way to support DW_OP_implicit_pointer at all yet. This also adds test cases for DWARF expressions, which we couldn't easily test before. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2021-06-05 16:18:51 -07:00
Omar Sandoval	a4b9d68a8c	Use GPL-3.0-or-later license identifier instead of GPL-3.0+ Apparently the latter is deprecated and the former is preferred. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2021-04-03 01:10:35 -07:00
Omar Sandoval	9fda010789	Track byte order in scalar types instead of objects Currently, reference objects and buffer value objects have a byte order. However, this doesn't always make sense for a couple of reasons: - Byte order is only meaningful for scalars. What does it mean for a struct to be big endian? A struct doesn't have a most or least significant byte; its scalar members do. - The DWARF specification allows either types or variables to have a byte order (DW_AT_endianity). The only producer I could find that uses this is GCC for the scalar_storage_order type attribute, and it only uses it for base types, not variables. GDB only seems to use to check it for base types, as well. So, remove the byte order from objects, and move it to integer, boolean, floating-point, and pointer types. This model makes more sense, and it means that we can get the binary representation of any object now. The only downside is that we can no longer support a bit offset for non-scalars, but as far as I can tell, nothing needs that. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2021-02-19 21:41:29 -08:00
Omar Sandoval	352c31e1ac	Add support for C++ template parameters Add struct drgn_type_template_parameter to libdrgn, the corresponding TypeTemplateParameter to the Python bindings, and support for parsing them from DWARF. With this, support for templates is almost, but not quite, complete. The main wart is that DW_TAG_name of compound types includes the template parameters, so the type tag includes it as well. We should remove that from the tag and instead have the type formatting code add it only when getting the full type name. Based on a patch from Jay Kamat. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2021-01-08 17:39:51 -08:00
Omar Sandoval	d35243b354	libdrgn: replace lazy types with lazy objects In order to support static members, methods, default function arguments, and value template parameters, we need to be able to store a drgn_object in a drgn_type_member or drgn_type_parameter. These are all cases where we want lazy evaluation, so we can replace drgn_lazy_type with a new drgn_lazy_object which implements the same idea but for objects. Types can still be represented with an absent object. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2021-01-08 17:39:51 -08:00
Omar Sandoval	7d7aa7bf7b	libdrgn/python: remove Type == operator The == operator on drgn.Type is only intended for testing. It's expensive and slow and not what people usually want. It's going to get even more awkward to define once types can refer to objects (for template parameters and static members and such). Let's replace == with a new identical() function only available in unit tests. Then, remove the operator from the Python bindings as well as the underlying libdrgn drgn_type_eq() and drgn_qualified_type_eq() functions. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-12-22 03:11:38 -08:00
Omar Sandoval	36068a0ea8	Fix trailing commas for Black v20.8b1 Black was recently changed to treat a trailing comma as an indicator to put each item/argument on its own line. We have a bunch of places where something previously had to be split into multiple lines, then was edited to fit on one line, but Black kept the trailing comma. Now this update wants to unnecessarily split it back up. For now, let's get rid of these commas. Hopefully in the future Black has a way to opt out of this. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-08-27 11:31:29 -07:00
Omar Sandoval	a97f6c4fa2	Associate types with program I originally envisioned types as dumb descriptors. This mostly works for C because in C, types are fairly simple. However, even then the drgn_program_member_info() API is awkward. You should be able to look up a member directly from a type, but we need the program for caching purposes. This has also held me back from adding offsetof() or has_member() APIs. Things get even messier with C++. C++ template parameters can be objects (e.g., template <int N>). Such parameters would best be represented by a drgn object, which we need a drgn program for. Static members are a similar case. So, let's reimagine types as being owned by a program. This has a few parts: 1. In libdrgn, simple types are now created by factory functions, drgn_foo_type_create(). 2. To handle their variable length fields, compound types, enum types, and function types are constructed with a "builder" API. 3. Simple types are deduplicated. 4. The Python type factory functions are replaced by methods of the Program class. 5. While we're changing the API, the parameters to pointer_type() and array_type() are reordered to be more logical (and to allow pointer_type() to take a default size of None for the program's default pointer size). 6. Likewise, the type factory methods take qualifiers as a keyword argument only. A big part of this change is updating the tests and splitting up large test cases into smaller ones in a few places. Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-08-26 17:41:09 -07:00
Omar Sandoval	4e770fb18a	Format imports with isort Signed-off-by: Omar Sandoval <osandov@osandov.com>	2020-08-20 16:55:07 -07:00
Omar Sandoval	8b264f8823	Update copyright headers to Facebook and add missing headers drgn was originally my side project, but for awhile now it's also been my work project. Update the copyright headers to reflect this, and add a copyright header to various files that were missing it.	2020-05-15 15:13:02 -07:00
Jay Kamat	6c264b0eae	libdrgn: add language to struct drgn_type For types obtained from DWARF, we determine it from the language of the CU. For other types, it can be specified manually or fall back to the default (C). Then, we can use the language for operations where the type is available.	2020-02-26 19:55:42 -08:00
Omar Sandoval	26ef465007	libdrgn/python: add proper type for members and parameters This continues the conversion from the last commit. Members and parameters are basically the same, so we can do them together. Unlike enumerators, these don't make sense to unpack or access as sequences.	2020-02-12 15:40:19 -08:00
Omar Sandoval	7c70a1a384	libdrgn/python: add proper type for enumerators Currently, type members, enumerators, and parameters are all represented by tuples in the Python bindings. This is awkward to document and implement. Instead, let's replace these tuples with proper types, starting with the easiest one, TypeEnumerator. This one still makes sense to treat as a sequence so that it can be unpacked as (name, value).	2020-02-12 15:37:41 -08:00
Omar Sandoval	660276a0b8	Format Python code with Black I'm not a fan of 100% of the Black coding style, but I've spent too much time manually formatting Python code, so let's just pull the trigger.	2020-01-14 11:51:58 -08:00
Amlan Nayak	0df2152307	Add basic class type support This implements the first step at supporting C++: class types. In particular, this adds a new drgn_type_kind, DRGN_TYPE_CLASS, and support for parsing DW_TAG_class_type from DWARF. Although classes are not valid in C, this adds support for pretty printing them, for completeness.	2019-11-18 10:36:40 -08:00
Omar Sandoval	690b5fd650	libdrgn: generalize architecture to platform For stack trace support, we'll need to have some architecture-specific functionality. drgn's current notion of an architecture doesn't actually include the instruction set architecture. This change expands it to a "platform", which includes the ISA as well as the existing flags.	2019-08-02 00:11:56 -07:00
Omar Sandoval	0c5df56fba	libdrgn: replace symbol index with object index struct drgn_symbol doesn't really represent a symbol; it's just an object which hasn't been fully initialized (see `c2be52dff0` ("libdrgn: rename object index to symbol index"), it used to be called a "partial object"). For stack traces, we're going to have a notion of a symbol that more closely represents an ELF symbol, so let's get rid of the temporary struct drgn_symbol representation and just return an object directly.	2019-07-29 17:04:47 -07:00
Omar Sandoval	b5b024ecac	tests: move common helpers to top-level	2019-07-29 17:04:47 -07:00
Omar Sandoval	baba1ff3f0	libdrgn: make program components pluggable Currently, programs can be created for three main use-cases: core dumps, the running kernel, and a running process. However, internally, the program memory, types, and symbols are pluggable. Expose that as a callback API, which makes it possible to use drgn in much more creative ways.	2019-05-10 12:41:07 -07:00
Omar Sandoval	a5fb8ed4c3	Add libelf/libdw wrappers to _drgn If libelf or libdw are statically linked or not in the usual search path, ctypes won't be able to find them. Add the required functions to the wrappers in the libdrgn bindings so that we can easily find them.	2019-04-02 14:12:11 -07:00
Omar Sandoval	ec75400cc0	type: improve type handling Add a parser for C type names and push the type logic out of the CoredumpObject class.	2018-02-24 19:37:51 -08:00
Omar Sandoval	97e604c05c	Rewrite in Cython The lldwarf/drgn.dwarf split wasn't working out too well, and moving all of drgn.dwarf into lldwarf (by rewriting it into C) would be way too much work. Instead, use Cython, which results in a parser which is just as fast but with much cleaner code overall. It also turns out lldwarf wasn't doing GC right, so the switch also fixed that.	2017-08-28 21:16:43 -07:00
Omar Sandoval	1fc2b6e89b	Initial commit I wrote all of this code a few months back and am just now getting around to committing it. The low-level DWARF parsing library is pretty solid, although it only implements a subset of DWARF so far. The CLI and higher-level interface are experimental.	2017-08-24 22:46:16 -07:00

34 Commits