mirror of
				https://github.com/isledecomp/isle.git
				synced 2025-10-21 23:44:15 +00:00 
			
		
		
		
	Remove reccmp README duplication (#1563)
Co-authored-by: jonschz <jonschz@users.noreply.github.com>
This commit is contained in:
		| @@ -15,8 +15,6 @@ To access the Ghidra repository, use the following details: | ||||
| - Address: `server.mattkc.com` | ||||
| - Port: `13100` | ||||
|  | ||||
| **Please note that at the time of writing, much of the information found on the Ghidra server is severely outdated**. Generally, the source code found in this repository represents the latest "source of truth" and should be referenced whenever possible. | ||||
|  | ||||
| ## General Guidelines | ||||
|  | ||||
| If you feel fit to contribute, feel free to create a pull request! Someone will review and merge it (or provide feedback) as soon as possible. | ||||
|   | ||||
							
								
								
									
										199
									
								
								tools/README.md
									
									
									
									
									
								
							
							
						
						
									
										199
									
								
								tools/README.md
									
									
									
									
									
								
							| @@ -2,202 +2,15 @@ | ||||
|  | ||||
| Accuracy to the game's original code is the main goal of this project. To facilitate the decompilation effort and maintain overall quality, we have devised a set of annotations, to be embedded in the source code, which allow us to automatically verify the accuracy of re-compiled functions' assembly, virtual tables, variable offsets and more. | ||||
|  | ||||
| In order for contributions to be accepted, the annotations must be used in accordance to the rules outlined here. Proper use is enforced by [GitHub Actions](/.github/workflows) which run the Python tools found in this folder. It is recommended to integrate these tools into your local development workflow as well. | ||||
| The tooling we have developed has been moved to the [reccmp](https://github.com/isledecomp/reccmp) repo to facilitate its use in other decompilation projects. | ||||
| * See the [README](https://github.com/isledecomp/reccmp?tab=readme-ov-file#getting-started) on how to get started. | ||||
| * Familiarize yourself with the available [annotations](https://github.com/isledecomp/reccmp/blob/master/docs/annotations.md) and the [best practices](https://github.com/isledecomp/reccmp/blob/master/docs/recommendations.md) we have established. | ||||
|  | ||||
| # Overview | ||||
|  | ||||
| We are continually working on extending the capabilities of our "decompilation language" and the toolset around it. Some of the following annotations have not made it into formal verification and thus are not technically enforced on the source code level yet (marked as **WIP**). Nevertheless, it is recommended to use them since it is highly likely they will eventually be fully integrated. | ||||
|  | ||||
| ## Functions | ||||
|  | ||||
| All non-inlined functions in the code base with the exception of [3rd party code](/3rdparty) must be annotated with one of the following markers, which include the module name and address of the function as found in the original binaries. This information is then used to compare the recompiled assembly with the original assembly, resulting in an accuracy score. Functions in a given compilation unit must be ordered by their address in ascending order. | ||||
|  | ||||
| The annotations can be attached to the function implementation, which is the most common case, or use the "comment" syntax (see examples below) for functions that cannot be referred to directly (such as templated, synthetic or non-inlined inline functions). The latter should only ever appear in `.h` files. | ||||
|  | ||||
| ### `FUNCTION` | ||||
|  | ||||
| Functions with a reasonably complete implementation which are not templated or synthetic (see below) should be annotated with `FUNCTION`. | ||||
|  | ||||
| ``` | ||||
| // FUNCTION: LEGO1 0x100b12c0 | ||||
| MxCore* MxObjectFactory::Create(const char* p_name) | ||||
| { | ||||
|   // implementation | ||||
| } | ||||
|  | ||||
| // FUNCTION: LEGO1 0x100140d0 | ||||
| // MxCore::IsA | ||||
| ``` | ||||
|  | ||||
| ### `STUB` | ||||
|  | ||||
| Functions with no or a very incomplete implementation should be annotated with `STUB`. These will not be compared to the original assembly. | ||||
|  | ||||
| ``` | ||||
| // STUB: LEGO1 0x10011d50 | ||||
| LegoCameraController::LegoCameraController() | ||||
| { | ||||
|   // TODO | ||||
| } | ||||
| ``` | ||||
|  | ||||
| ### `TEMPLATE` | ||||
|  | ||||
| Templated functions should be annotated with `TEMPLATE`. Since the goal is to eventually have a full accounting of all the functions present in the binaries, please make an effort to find and annotate every function of a templated class. | ||||
|  | ||||
| ``` | ||||
| // TEMPLATE: LEGO1 0x100c0ee0 | ||||
| // list<MxNextActionDataStart *,allocator<MxNextActionDataStart *> >::_Buynode | ||||
|  | ||||
| // TEMPLATE: LEGO1 0x100c0fc0 | ||||
| // MxStreamListMxDSSubscriber::~MxStreamListMxDSSubscriber | ||||
|  | ||||
| // TEMPLATE: LEGO1 0x100c1010 | ||||
| // MxStreamListMxDSAction::~MxStreamListMxDSAction | ||||
| ``` | ||||
|  | ||||
| ### `SYNTHETIC` | ||||
|  | ||||
| Synthetic functions should be annotated with `SYNTHETIC`. A synthetic function is generated by the compiler; most common is the "scalar deleting destructor" found in virtual tables. Other cases include default destructors and assignment operators. Note: `SYNTHETIC` takes precedence over `TEMPLATE`. | ||||
|  | ||||
| ``` | ||||
| // SYNTHETIC: LEGO1 0x10003210 | ||||
| // Helicopter::`scalar deleting destructor' | ||||
|  | ||||
| // SYNTHETIC: LEGO1 0x100c4f50 | ||||
| // MxCollection<MxRegionLeftRight *>::`scalar deleting destructor' | ||||
|  | ||||
| // SYNTHETIC: LEGO1 0x100c4fc0 | ||||
| // MxList<MxRegionLeftRight *>::`scalar deleting destructor' | ||||
| ``` | ||||
|  | ||||
| ### `LIBRARY` | ||||
|  | ||||
| Functions located in 3rd party libraries should be annotated with `LIBRARY`. Since the goal is to eventually have a full accounting of all the functions present in the binaries, please make an effort to find and annotate every function of every statically linked library, including the MSVC standard libraries. | ||||
|  | ||||
| ``` | ||||
| // LIBRARY: ISLE 0x4061b0 | ||||
| // _MemPoolInit@4 | ||||
|  | ||||
| // LIBRARY: ISLE 0x406520 | ||||
| // _MemPoolSetPageSize@8 | ||||
|  | ||||
| // LIBRARY: ISLE 0x406630 | ||||
| // _MemPoolSetBlockSizeFS@8 | ||||
| ``` | ||||
|  | ||||
| ## Virtual tables | ||||
|  | ||||
| Classes with a virtual table should be annotated using the `VTABLE` marker, which includes the module name and address of the virtual table. Additionally, virtual function declarations should be annotated with a comment indicating their relative offset. Please use the following example as a reference. | ||||
|  | ||||
| ``` | ||||
| // VTABLE: LEGO1 0x100dc900 | ||||
| class MxEventManager : public MxMediaManager { | ||||
| public: | ||||
| 	MxEventManager(); | ||||
| 	virtual ~MxEventManager() override; | ||||
|  | ||||
| 	virtual void Destroy() override;                                     // vtable+0x18 | ||||
| 	virtual MxResult Create(MxU32 p_frequencyMS, MxBool p_createThread); // vtable+0x28 | ||||
| ``` | ||||
|  | ||||
| ## Class size | ||||
|  | ||||
| Classes should be annotated using the `SIZE` marker to indicate their size. If you are unsure about the class size in the original binary, please use the currently available information (known member variables) and detail the circumstances in an extra comment if necessary. | ||||
|  | ||||
| ``` | ||||
| // SIZE 0x1c | ||||
| class MxCriticalSection { | ||||
| public: | ||||
| 	MxCriticalSection(); | ||||
| 	~MxCriticalSection(); | ||||
| 	static void SetDoMutex(); | ||||
| ``` | ||||
|  | ||||
| Furthermore, add `DECOMP_SIZE_ASSERT(MxCriticalSection, 0x1c)` to the respective `.cpp` file (if the class has no dedicated `.cpp` file, use any appropriate `.cpp` file where the class is used). | ||||
|  | ||||
| ## Member variables | ||||
|  | ||||
| Member variables should be annotated with their relative offsets. | ||||
|  | ||||
| ``` | ||||
| class MxDSObject : public MxCore { | ||||
| private: | ||||
| 	MxU32 m_sizeOnDisk;   // 0x8 | ||||
| 	MxU16 m_type;         // 0xc | ||||
| 	char* m_sourceName;   // 0x10 | ||||
| 	undefined4 m_unk0x14; // 0x14 | ||||
| ``` | ||||
|  | ||||
| ## Global variables | ||||
|  | ||||
| Global variables should be annotated using the `GLOBAL` marker, which includes the module name and address of the variable. | ||||
|  | ||||
| ``` | ||||
| // GLOBAL: LEGO1 0x100f456c | ||||
| MxAtomId* g_jukeboxScript = NULL; | ||||
|  | ||||
| // GLOBAL: LEGO1 0x100f4570 | ||||
| MxAtomId* g_pz5Script = NULL; | ||||
|  | ||||
| // GLOBAL: LEGO1 0x100f4574 | ||||
| MxAtomId* g_introScript = NULL; | ||||
| ``` | ||||
|  | ||||
| ## Strings | ||||
|  | ||||
| String values should be annotated using the `STRING` marker, which includes the module name and address of the string. | ||||
|  | ||||
| ``` | ||||
| inline virtual const char* ClassName() const override // vtable+0x0c | ||||
| { | ||||
| 	// STRING: LEGO1 0x100f03fc | ||||
| 	return "Act2PoliceStation"; | ||||
| } | ||||
| ``` | ||||
|  | ||||
| # Tooling | ||||
|  | ||||
| Use `pip` to install the required packages to be able to use the Python tools found in this folder: | ||||
|  | ||||
| ```sh | ||||
| pip install -r tools/requirements.txt | ||||
| ``` | ||||
|  | ||||
| Run the following command to allow reccmp to detect the original LEGO binaries: | ||||
|  | ||||
| ```sh | ||||
| reccmp-project detect --what original --search-path <paths-to-directories0containing-lego-binaries> | ||||
| ``` | ||||
|  | ||||
| After building recompiled binaries, run the following command in this repository's root: | ||||
|  | ||||
| ```sh | ||||
| reccmp-project detect --what recompiled --search-path <paths-to-build-directories> | ||||
| ``` | ||||
|  | ||||
| The example usages below assume that the current working directory is this repository's root and that the retail binaries have been copied to `./legobin`. | ||||
|  | ||||
| * `reccmp-decomplint`: Checks the decompilation annotations (see above) | ||||
|     * e.g. `reccmp-decomplint --module LEGO1 LEGO1` | ||||
| * [`ncc`](/tools/ncc): Checks naming conventions based on a set of rules | ||||
| * `reccmp-reccmp`: Compares an original binary with a recompiled binary, provided a PDB file. For example: | ||||
|     * Display the diff for a single function: `py -m tools.reccmp.reccmp --verbose 0x100ae1a0 legobin/LEGO1.DLL build/LEGO1.DLL build/LEGO1.PDB .` | ||||
|     * Generate an HTML report: `py -m tools.reccmp.reccmp --html output.html legobin/LEGO1.DLL build/LEGO1.DLL build/LEGO1.PDB .` | ||||
|     * Create a base file for diffs: `py -m tools.reccmp.reccmp --json base.json --silent legobin/LEGO1.DLL build/LEGO1.DLL build/LEGO1.PDB .` | ||||
|     * Diff against a base file: `py -m tools.reccmp.reccmp --diff base.json legobin/LEGO1.DLL build/LEGO1.DLL build/LEGO1.PDB .` | ||||
| * `reccmp-stackcmp`: Compares the stack layout for a given function that almost matches. | ||||
|     * e.g. `reccmp-stackcmp legobin/BETA10.DLL build_debug/LEGO1.DLL build_debug/LEGO1.pdb . 0x1007165d` | ||||
| * `reccmp-roadmap`: Compares symbol locations in an original binary with the same symbol locations of a recompiled binary | ||||
| * `reccmp-verexp`: Verifies exports by comparing the exports of the original DLL and the recompiled DLL | ||||
| * `reccmp-vtable`: Asserts virtual table correctness by comparing a recompiled binary with the original | ||||
|     * e.g. `reccmp-vtable legobin/LEGO1.DLL build/LEGO1.DLL build/LEGO1.PDB .` | ||||
| * `reccmp-datacmp`: Compares global data found in the original with the recompiled version | ||||
|     * e.g. `reccmp-datacmp legobin/LEGO1.DLL build/LEGO1.DLL build/LEGO1.PDB .` | ||||
| * [`patch_c2.py`](/tools/patch_c2.py): Patches `C2.EXE` (part of MSVC 4.20) to get rid of a bugged warning | ||||
| The following scripts are specific to LEGO Island and have thus remained here: | ||||
| * [`patch_c2.py`](/tools/patch_c2.py): Patches `C2.EXE` (part of MSVC 4.20) to get rid of a bugged warning. | ||||
|  | ||||
| # Modules | ||||
| The following is a list of all the modules found in the annotations (e.g. `// FUNCTION: [module] [address]`) and which binaries they refer to. See [this list of all known versions of the game](https://www.legoisland.org/wiki/LEGO_Island#Download). | ||||
| The following is a list of all the modules found in the annotations (e.g. `// FUNCTION: [module] [address]`) and which binaries they refer to. See also [this list of all known versions of the game](https://www.legoisland.org/wiki/LEGO_Island#Download). | ||||
|  | ||||
| ## Retail v1.1.0.0 (v1.1) | ||||
| * `LEGO1` -> `LEGO1.DLL` | ||||
|   | ||||
		Reference in New Issue
	
	Block a user
	 jonschz
					jonschz