object-introspection

mirror of https://github.com/JakeHillion/object-introspection.git synced 2024-11-09 21:24:14 +00:00

Author	SHA1	Message	Date
Jon Haslam	3513f9580a	TypeGraphv2: correctly handle bitfields in code generation (#495 ) Currently in TypeGraph when generating inst::Field objects in the generated source we use the `sizeof` operator to construct the static and exclusive size. As you can't use sizeof() on a bitfield this generates invalid code. This fix special cases bit fields and sets the static and exclusive size to 0 as there size will be rolled up in the parent object.	2024-04-23 14:56:09 +01:00
Jake Hillion	a4723fb2ef	tbv2: update std::variant `std::variant` is the last archetypal container missing in TreeBuilder-v2. The code for it isn't hugely complicated and relies on pack expansion. This change introduces a new field to the container specification: `scoped_extra`. This field allows you to write extra code that will be included within the TypeHandler in CodeGen. This means it will not have collisions with other containers, unlike the existing `extra` field. It's used here to write the recursive `getSizeType` function for `std::variant`. Tech debt is introduced here by comparing the container name to `std::variant` in CodeGen to conditionally generate some code. We've worked hard to remove references to containers in code and move them to `.toml` files. On balance, this is worth having to include the example of `std::variant`. It should be moved into a container spec field at some point, the design of which is still to be determined. Test plan: - Activated the OIL `std::variant` tests. - CI	2024-02-23 16:16:22 +00:00
Jon Haslam	f2afeabc3c	Handle stubbed types in NameProvider (#477 )	2024-02-05 17:25:19 +00:00
Jon Haslam	31bf9e7b59	Make KeyCapture work with nested typedefs (#473 )	2024-01-31 17:03:05 +00:00
Jake Hillion	7de35863f5	tbv2: fix Thrift isset lookups with padding Thrift isset was failing with a SEGV if the struct contained padding. This is because we indexed the `isset_indexes` data structure using our field index rather than the index of the field in Thrift. This then gave a rubbish index for any exceeding which happens if we have added padding in the middle of the struct, and this index was looked up in the bitset which can cause a SEGV. Track a new index `thriftFieldIdx` which is only incremented if we've looked up a Thrift index. Namespaced the generated Thrift structs while I was there. This isn't necessary anymore but cleans things up. Test plan: - Added a test case with lots of padding. These don't run in the CI but it passes locally. - `FILTER='OilIntegration.' make test` - no failures - `FILTER='OidIntegration.' make test` - no new failures	2024-01-19 19:16:46 +00:00
Jake Hillion	31ba8659f0	tbv2: fix pointer codegen A previous change enabled running OIL tests with specific features enabled. This highlighted that pointer code generation under TreeBuilder-v2 was very broken. This change updates pointer code generation to work and enables the skipped tests. All enabled tests need `expected_json_v2` added to them due to formatting differences. Reformatted and rewrote the basic type handler that handles primitives and pointers. Removed the reliance on `features` to decide whether to generate for TreeBuilder-v2 as the intermediate features have been removed. Pointers are treated as containers with a capacity of 1 and a length of 0 if null/a cycle and 1 if followed. This holds for void pointers where, although they aren't followed, the length is still set. There were a couple of other changes needed to enable these tests on TBv2 that aren't worth their own issues and PRs, I sneaked them in here. Extra changes: - Added `Pointer` and `Reference` to TopoSorter so they generate `NameProvider` instances. It might be worth visiting the graph differently for `NameProvider` as it requires so many instances that others generators do not. Will consider that in the future. - Follow typedefs when calculating exclusive size for a type. Closes #458. Test plan: - CI - Enabled previously disabled tests.	2024-01-18 16:22:18 +00:00
Jake Hillion	40af807d8b	tbv2: support capture-thrift-isset Support the capture-thrift-isset feature with TreeBuilder-v2. Fairly minor changes here except the type of the Enum in a template parameter now matters. We follow the previous behaviour of capturing a value for each field in a struct that has an `isset_bitset`. This value is a VarInt captured before the C++ contents of the member. It has 3 values: 0 (not set), 1 (set), and 2 (unavailable). These are handled by the processor and represented in the output as `false`, `true`, and `std::nullopt_t` respectively. Changes: - Add a simple Thrift isset processor before any fields that have Thrift isset. - Store the fully qualified names of enum types in DrgnParser - it already worked out this information anyway for naming the values and this is consistent with classes. - Forward all enum template parameters under their input name under the assumption that they will all be policy type things like `IssetBitsetOption`. This could turn out to be wrong. Test plan: - CI (doesn't test thrift changes but covers other regressions) - Updated Thrift enum tests for new format. - `FILTER='OilIntegration.*' make test` - Thrift tests failed before, succeed after.	2024-01-16 19:09:46 +00:00
Jake Hillion	4c047b5f91	tbv2: add is_primitive to output C++ has a concept of Primitive which holds in the type graph. However we don't currently expose this information to the end user. Expose this from the OIL iterator to allow future features like primitive rollups. This affects containers like maps which have a fake `[]` element with no type. They use this to group together the key/value in a map and to account for any per element storage overhead. Currently the decision is to make the fake `[]` element a primitive if all of its children are primitives. This allows for more effective primitive rollups if that is implemented. This implementation detail may be changed in future. Test Plan: - CI - Updated simple tests.	2024-01-16 11:14:13 +00:00
Jake Hillion	db93feb180	incomplete: name type in compiler errors Summary: We have a good type representation in the Type Graph of an incomplete type and the underlying type that represents. However, this incomplete type still ends up in the generated code as `void` which loses information. For example, a container that can't contain void may fail to compile because it was initialised with `void` but really its because the type it was supposed to be initialised with (say, `Foo`) had incomplete debug information. This change identifies that a type is incomplete in the output by generating it as an incomplete type `struct Incomplete<struct Foo>`. This allows us to name the type correctly in the TreeBuilder output and filter for incomplete types, as well as getting appropriate compiler errors if it mustn't be incomplete. Test Plan: - CI - Added a unit test to namegen. - Enabled and added an extra pointers_incomplete test. This change is tricky to test because it isn't really user visible. The types still use their `inputName` which is unchanged in any successful output - this change is used so the compiler fails with a more detailed error.	2024-01-09 15:08:25 +00:00
Jake Hillion	6d898bed95	tbv2: account for duplicate types when emitting name providers ClangTypeParser has emitted a duplicate type for `std::allocatr<int8_t>`. Rather than fixing this, add the same check the compiler will do for the duplicate templates that `addNames` emits. That is, `template<> NameProvider<Foo>` will collide if `Foo` is used twice. We can do this by adding a set of these strings for now. If this shows up regularly it will likely make sense to deduplicate the type graph with a deduplication pass. Test plan: - Fixes the issue in prod. This change is quite logical.	2023-12-20 16:14:48 +00:00
Jake Hillion	55989a9156	oilgen: migrate to source parsing (#421 ) Summary: oilgen: migrate to source parsing Using debug information generated from partial source (that is, not the final binary) has been insufficient to generally generate OIL code. A particular example is pointers to templates: ```cpp #include <oi/oi.h> template <typename T> struct Foo { T t; }; template <typename T> struct Bar { Foo<T>& f; }; void foo(const Bar<int>& b) { oi::introspect(b); } ``` The pointer/reference to `Foo<int>` appears in DWARF with `DW_AT_declaration(true)` because it could be specialised before its usage. However, with OIL, we are creating an implicit usage site in the `oi::introspect` call that the compiler is unable to see. This change reworks OILGen to work from a Clang command line rather than debug information. We setup and run a compiler on the source, giving us access to an AST and Semantic Analyser. We then: - Find the `oi::introspect` template. - Iterate through each of its callsites for their type. - Run `ClangTypeParser::parse` on each type. - Run codegen. - Compile into an object file. Having access to the semantic analyser allows us to forcefully complete a type, as it would be if it was used in the initial code. Test Plan: hope `buck2 run fbcode//mode/opt fbcode//object-introspection/oil/examples/compile-time:compile-time` Reviewed By: tyroguru Differential Revision: D51854477 Pulled By: JakeHillion	2023-12-19 13:26:25 -08:00
Jake Hillion	37b89d789d	codegen: remove reliance on drgn type for top level name Currently we rely on `SymbolService::getTypeName` for getting the hash that's included in the generated function's name. The value of this must stay the same to match with the value expected by OIDebugger - changing it causes failure to relocate when attaching with OID and JIT OIL. Calculate this name in the `codegenFromDrgn` method and pass it through where appropriate rather than passing the `drgn_type` itself through. We don't need to name the type like that when using AoT OIL. Let's hash the linkage name instead as that is more unique. Test Plan: - CI	2023-12-19 15:35:59 +00:00
Alastair Robertson	2060a0491e	CodeGen v2: Enable independent running without CodeGen v1 Create DrgnExporter to translate Type Graph "Type" nodes into drgn_type structs, suitable for use in OICache and TreeBuilder.	2023-12-15 14:57:24 +00:00
Alastair Robertson	688d483c0c	TypeGraph: Fix handling for classes which inherit from containers We previously moved container identification later in CodeGen in order to preserve information for AlignmentCalc. However, Flattener needs to know if a class is a container in order to apply its special handling for this case. This new approach moves container identification in front of Flattener, but has Container own a type node, representing its layout. This underlying type node can be used for calculating a container's alignment in a later pass.	2023-12-14 18:02:45 +00:00
Jake Hillion	b117150f83	tbv2: add dynamic context passed through all functions (#410 ) Summary: tbv2: add dynamic context passed through all functions Previously for we had some shared state between all requests, noticeably the pointers set. This change adds a by reference value to all requests which can hold additional mutable state. The pointers set is moved into this mutable state for OIL, which means each concurrent request will have its own pointer set. Doing things this way allows more features to be added in the future without such a big code modification. Closes https://github.com/facebookexperimental/object-introspection/issues/404 Pull Request resolved: https://github.com/facebookexperimental/object-introspection/pull/410 Test Plan: - CI Differential Revision: D51394035 Pulled By: JakeHillion fbshipit-source-id: 55d2ba9b5e056148a29dc821020cfc3d94e5175a	2023-11-16 08:03:32 -08:00
Jake Hillion	592e182e0f	tbv2: replace DB template param with Ctx (#409 ) Summary: tbv2: replace DB template param with Ctx TreeBuilder v2 adds a DB template parameter to every function. This is used as part of the static type to decide what type of DataBuffer is being used: currently `BackInserterDataBuffer<std::vector<uint8_t>>` for OIL and it would be `DataSegmentDataBuffer` for OID. This change replaces the `DB` template parameter with a more general `Ctx`. Due to issues with dependent naming it also adds a `using DB` to each `TypeHandler` which has the same function as before. This allows us to add more "static context" (typedefs and constants) to functions without changing this signature again, because changing the signature of everything is a massive pain. Currently this change achieves nothing because Ctx contains only DB in a static wrapper. In the next change I'm going to pass a reference of type Ctx around to add a "dynamic context" to invocations which will contain the pointer array. In future we'll then be able to add either static or dynamic context without any signature adjustments. Test Plan: - CI --- Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/facebookexperimental/object-introspection/pull/409). * https://github.com/facebookexperimental/object-introspection/issues/410 * __->__ https://github.com/facebookexperimental/object-introspection/issues/409 Reviewed By: ajor Differential Revision: D51352092 Pulled By: JakeHillion	2023-11-15 11:52:17 -08:00
Jake Hillion	3871d92abb	collapse TreeBuilderV2 features Summary: Currently there are two features between CodeGen v2 (TypeGraph) and TreeBuilder v2. These are TypedDataSegment and TreeBuilderTypeChecking. Each of these features currently has a full set of tests run in the CI and each have specific exclusions. Collapse these features into TreeBuilder v2. This allows for significantly simplified testing as any OIL tests run under TreeBuilder v2 and any OID tests run under TreeBuilder v1. The reasoning behind this is I no longer intend to partially roll out this feature. Full TreeBuilder v2 applies different conditions to containers than the intermediate states, and writing these only to have them never deployed is a waste of time. Test Plan: - it builds - CI	2023-11-13 19:43:03 +00:00
Jake Hillion	393f8aab42	clang-format: disable bin packing Bin packing often makes code hard to read. Disable it entirely. Test plan: - CI	2023-11-13 18:19:53 +00:00
Alastair Robertson	c207972af6	TypeGraph: Calculate alignment before identifying containers Not all containers have 8-byte alignment, so if we want to avoid lots of manual logic for calculating container alignment on a case-by-case basis, we must calculate alignment from the member variables before the Class nodes have been replaced by Container nodes.	2023-11-06 13:16:30 +00:00
Alastair Robertson	2c5fb5d845	TypeGraph: Stop identifying containers in DrgnParser Leave it to the new mutator pass IdentifyContainers to replace Class nodes with Container nodes where appropriate. This will allow us to run passes over the type graph before identifying containers, and therefore before we have lost information about the internal details of the container (e.g. alignment of member variables).	2023-11-06 11:45:57 +00:00
Jake Hillion	6e1635ce1e	remove oil v1 leftovers	2023-10-30 18:18:13 +00:00
Jake Hillion	f7bb1e75ad	tbv2: fix exclusive size of elements in containers	2023-10-16 19:18:42 +01:00
Jake Hillion	4c6f232766	containers: add required features (#374 ) Summary: Adds the option for required features to container definitions. These cause the container not to be passed to `DrgnParser` if that feature is not enabled during code generation. The thrift isset type does not currently work with `tree-builder-v2` and only provides benefit with `capture-thrift-isset`. This change makes sure the container is ignored if it won't be useful, allowing code generation under `tree-builder-v2`. Test Plan: - CI Differential Revision: D49960512 Pulled By: JakeHillion	2023-10-09 17:50:39 -04:00
Thierry Treyer	7eee6e0871	Add container_type alias in container TypeHandler	2023-10-09 10:41:34 -06:00
Alastair Robertson	4afa2ff190	CodeGen: Add support for key capture	2023-09-28 19:42:49 -07:00
Jake Hillion	7a7a9b347a	jit logging: fix for codegen v1	2023-09-26 18:41:27 -07:00
Alastair Robertson	bd826f9794	CodeGen: Store list of ContainerInfos in unique_ptrs for reference stability Lots of places rely on reference stability of ContainerInfo objects (CodeGen's deduplication, Container nodes' containerInfo_ member). In the key capture work, we need to be able to append to this list, which would invalidate references before this change.	2023-09-22 17:10:56 +01:00
Jake Hillion	c766d7b572	codegen: generate enums as enum class	2023-09-20 18:13:45 +01:00
Jake Hillion	8dd1182323	static types: add consume function similar to delegate	2023-09-15 14:43:09 +01:00
Jake Hillion	d71307cb43	oil: change std::stack reference to a std::function (#345 ) Summary: Previously on large types OIL would have problems with corrupting the `std::stack<exporter::inst::Inst>` that is passed to the processors. This change hides the implementation of the stack from the processors by wrapping the call to emplace in a `std::function` written by the non-generated code, which solves the test case I've seen for this crashing. It also allows us to easily change the stack implementation in future - I plan to change it to a `std::stack<T, std::vector<T>>` in a follow up. Reviewed By: tyroguru Differential Revision: D49273116	2023-09-14 16:57:45 +01:00
Jake Hillion	a9cffbe22d	Compile Time OIL for OIL v2 Summary: Update `OIGenerator` and out BUCK stuff for compile time OIL with OIL v2. Main things: - Switch `OIGenerator` from the `getObjectSize` call to the new `introspect` call. - Switch from looking at template parameters to looking at function parameters, as this was exposing a bug in our elfutils/drgn and this way it's the same as OID. - Migrate `OIGenerator` to CodeGen v2 and update CodeGen v2 to accept a linkage name. - Update the compile time example to be the same as the JIT example, using the new interface and the JSON exporter. - Clean up the `ObjectIntrospection.h` header. Differential Revision: D48687728 fbshipit-source-id: 2c3c041fd1b6499c5e02eb5e2082a977bfa529d7	2023-08-30 04:28:02 -07:00
Jake Hillion	6b90401f51	tbv2: add make_field helper	2023-08-29 17:41:10 +01:00
Jake Hillion	d009f02ecb	name contained types properly in treebuilder v2 Types within containers were previously named TODO. This sorts it out so they're named as their most resolved type. The current implementation skips Typedef names.	2023-08-25 16:25:14 +01:00
Jake Hillion	54c8d79996	oil: fix verbosity level in jit oil	2023-08-24 17:39:12 +01:00
Jake Hillion	5071519e45	oil v2	2023-08-23 15:59:53 +01:00
Alastair Robertson	373dbe8f6c	TypeGraph: Add option to record enumerator names	2023-08-17 15:47:23 +01:00
Jake Hillion	063646a1d9	codegen: split genClassTypeHandler	2023-08-16 14:40:07 +01:00
Alastair Robertson	ed2c6f357d	CodeGen: Apply "alignas" to all stubbed types Stubbing types listed in OICodeGen's "typesToStub" leaves them in the same state as stubbed unions, so they need the same handling.	2023-08-01 17:53:39 +01:00
Jake Hillion	d7cfcca30d	move all internal logic to oi::detail namespace	2023-07-26 18:01:38 +01:00
Alastair Robertson	8b7bfbe4c0	TypeGraph: Add EnforceCompatibility pass This extracts the compatibility logic from AddPadding, which allows for it to be simplified and will make it easier to extend and eventually remove in the future. No functional changes.	2023-07-26 17:12:36 +01:00
Alastair Robertson	45c3697f6b	TypeGraph: Add Prune pass This lets us remove fields from types when they are no longer needed, speeding up later passes. A secondary benefit of pruning unused types means that we sometimes remove types for which we can't generate correct C++ code. This can allow us to CodeGen for complex types which reference these broken types without actually requiring them (e.g. as template parameters). Add a new feature flag "prune-type-graph" to control this pass. It makes sense to prune most of the time, but for testing CodeGen functionality on a wider range of types, it will be useful to have the option to not prune.	2023-07-26 16:56:34 +01:00
Alastair Robertson	884b9a6e95	CodeGen: Don't measure the sizes of union members In general, we can't tell which member is active in a union so it is not safe to try and measure any of them. Explicitly set the alignment of unions (and structs/classes) in CodeGen if it is available, as the C++ compiler can no longer infer it from the members.	2023-07-26 11:16:30 +01:00
Alastair Robertson	2d1cc92bb4	Rename RemoveIgnored -> RemoveMembers Also reshuffle CodeGen's passes to fix an alignment bug with removed members. Change RemoveMembers to actually remove members instead of replacing them with padding. AddPadding must be run afterwards to fill in the gaps.	2023-07-26 10:53:04 +01:00
Jake Hillion	623f896e9e	increase verbosity of size static asserts	2023-07-24 18:48:59 +01:00
Alastair Robertson	3b6b739d55	CodeGen: Add comments explaining the order of passes	2023-07-18 17:20:59 +01:00
Jake Hillion	31f46831c2	typing: handle padding only structs	2023-07-18 17:17:32 +01:00
Jake Hillion	bd948152b7	add exporters::TypeCheckingWalker	2023-07-13 16:05:24 +01:00
Jake Hillion	032c28c0ea	type checking: add description of data segment type	2023-07-13 16:05:24 +01:00
Jake Hillion	02defdb859	comment existing typed data segment work	2023-07-10 21:09:43 +01:00
Alastair Robertson	3ec81aaa5f	TypeGraph: Add "--tree-builder-v2" flag This will eventually be used to enable running with Tree Builder v2. For now, when it is disabled it puts CodeGen v2 into compatibility mode, disabling features which weren't present in CodeGen v1 so that its output can be understood by Tree Builder v1.	2023-07-06 17:43:09 +01:00

1 2

65 Commits