Reviving the language that brought us the Jak & Daxter Series
Go to file
water111 b56025412b
Recognize auto-generated inspect methods and create deftypes from them (#95)
- Recognize new type definitions/parents/type flags in the decompiler
- Analyze autogenerated inspect methods and dump guesses at fields to a file
- Utility functions for accessing static data by label
- Better ordering in the decompiler to go through functions in the order they appeared in the source
- Added a decent number of types to `all-types.gc` based on the new field analyzer
- Correct a few `int`/`integer` mistakes in `gcommon.gc` (this should really be a warning)
- Correct a few type issues in `gcommon` and `gkernel-h`
- Option in the decompiler to be strict about `define-extern` redefining a type of a symbol
- Add a test to check consistency in types between `all-types.gc` (used by decompiler) and `goal_src` (used by the compiler)
2020-10-24 22:51:40 -04:00
.github Disable ccache for now, causing coverage problems 2020-10-09 21:18:05 -04:00
.vs Implement logging features for decompiler (#78) 2020-10-15 20:59:30 -04:00
asset_tool check in existing work 2020-08-22 22:30:12 -04:00
cmake/modules Add Coveralls.io 2020-09-08 15:47:25 -04:00
common Recognize auto-generated inspect methods and create deftypes from them (#95) 2020-10-24 22:51:40 -04:00
data check in existing work 2020-08-22 22:30:12 -04:00
decompiler Recognize auto-generated inspect methods and create deftypes from them (#95) 2020-10-24 22:51:40 -04:00
decompiler_out add decompiler 2020-08-22 23:30:17 -04:00
doc Add features for gkernel 2 (#89) 2020-10-17 11:18:58 -04:00
game Tweak logger settings to make logging happen in order (#73) 2020-10-11 19:55:29 -04:00
goal_src Recognize auto-generated inspect methods and create deftypes from them (#95) 2020-10-24 22:51:40 -04:00
goalc Recognize auto-generated inspect methods and create deftypes from them (#95) 2020-10-24 22:51:40 -04:00
iso_data check in existing work 2020-08-22 22:30:12 -04:00
out check in existing work 2020-08-22 22:30:12 -04:00
resources check in existing work 2020-08-22 22:30:12 -04:00
scripts Script to add #pragma once headers 2020-09-13 21:32:40 -04:00
test Recognize auto-generated inspect methods and create deftypes from them (#95) 2020-10-24 22:51:40 -04:00
third-party clean up some warnings and add some extra integer tests 2020-10-09 21:28:41 -04:00
.clang-format check in existing work 2020-08-22 22:30:12 -04:00
.gitattributes Ignore third-party folder in PRs, add syntax highlighting for GOAL files 2020-09-12 23:52:26 -04:00
.gitignore Implement logging features for decompiler (#78) 2020-10-15 20:59:30 -04:00
.gitmodules spdlog implementation (#58) 2020-10-03 09:39:04 -04:00
boot_kernel.sh Compiler Cleanup (Part 2) (#56) 2020-09-25 21:11:27 -04:00
CMakeLists.txt Merge branch 'doctashay-logging2' 2020-10-11 19:12:14 -04:00
CMakeSettings.json Implement logging features for decompiler (#78) 2020-10-15 20:59:30 -04:00
decomp.sh Add decompiler IR, basic operations, all-types file (#57) 2020-09-29 20:24:15 -04:00
gc.sh Move duplicated utilities to the common util folder and remove NEXT_DIR (#29) 2020-09-10 20:03:31 -04:00
gk.sh Tweak logger settings to make logging happen in order (#73) 2020-10-11 19:55:29 -04:00
README.md Update README with new badges 2020-10-09 21:38:13 -04:00
test_code_coverage.sh Move duplicated utilities to the common util folder and remove NEXT_DIR (#29) 2020-09-10 20:03:31 -04:00
test-cov.sh Move duplicated utilities to the common util folder and remove NEXT_DIR (#29) 2020-09-10 20:03:31 -04:00
test.sh Move duplicated utilities to the common util folder and remove NEXT_DIR (#29) 2020-09-10 20:03:31 -04:00

Jak Project

Linux Windows Coverage Status

Table of Contents

Requirements

  • cmake for build system
  • clang-format for formatting code (there is already a .clang-format provided)
  • gtest for testing. (Run git submodule update --init --recursive to check out the repository)
  • nasm for assembling x86
  • Third party libraries (nlohmann/json, minilzo, and linenoise) are provided in the third-party folder

Getting Started - Linux (Ubuntu)

Install Packages and Init Repository

sudo apt install gcc make cmake build-essential g++ nasm clang-format
git submodule update --init --recursive

Compile

mkdir build && cd build && cmake .. && make -j

Run Tests

./test.sh

Getting Started - Windows

Install Visual Studio 2019 and get the C++ and CMake tools via the Visual Studio Installer

On Windows, it's recommended to get Scoop to use as a package manager, making the follow steps much easier. Follow the steps on the bottom of the homepage here https://scoop.sh/

Once Scoop is installed, run the following command:

scoop install llvm nasm

Initialize the repository's third-party dependencies:

git submodule update --init --recursive

Open the project as a CMake project, browse for the root level CMakeLists.txt:

In the toolbar, you should be able to select an individual component to compile, or combine within the root CMakeLists.txt. In the future we will pre-define configurations to make this easier.

You may also wish to view the files that pertain to each CMake target, rather than the project as it is normally:

TODO

  • more steps to follow as we actually figure it out!
  • running tests
  • etc

Project Description

This project is to port Jak 1 (NTSC, "black label" version) to PC. The strategy is to:

  • recompile for x86 to get much better performance than emulation
  • create human-reabable GOAL source code that can be modified
  • create a GOAL compiler for x86-64 which supports live patching of code like the original
  • attempt to match the original game as close as possible (for no reason other than it's fun)
  • unpack assets in a format that can be modified

There are 6 components to this project

  • GOAL decompiler. The result will be manually cleaned up for running on a PC.
  • GOAL compiler for x86-64.
  • Game source code, made from cleaning up the result of the GOAL decompiler.
  • GOAL runtime. This will replace the parts of the game written in C++
  • Asset extraction tool to extract the models/textures/large data from the game
  • Asset packing tool.

The process to build the port will be

  • Build data extraction tool, GOAL compiler, and GOAL runtime library (all written in C++)
  • Run the GOAL compiler on the game source code to build the game engine
  • Run asset extraction on the game disc to get level data, textures, geometry data, music...
  • Run the asset packing tool to combine the unpacked assets with the compiled game engine to create the game!

Some statistics:

  • Estimated ~500k lines of GOAL code
  • 10410 functions
  • 5451 functions with no control flow (no branching, loops, if/else, short-circuiting boolean operators, gotos, etc)

The rough timeline is to finish sometime in 2022. If it looks like this is impossible, the project will be abandoned. But I have already spent about 4 months preparing to start this and seems doable. I also have some background in compilers, and familiarity with PS2 (worked on DobieStation PS2 emulator) / MIPS in general (wrote a PS1 emulator). I think the trick will be making good automated tools - the approach taken for SM64 and other N64 decompilations is way too labor-intensive to work.

Project Layout

  • goalc is the GOAL compiler
  • decompiler is the decompiler
  • decompiler_out is the decompiler output
  • data will contain big assets and the output of the GOAL compiler (not checked in to git)
  • out will contain the finished game (not checked into git)
  • resources will contain data which is checked into git
  • game will contain the game source code (C/C++)
  • goal_src will contain GOAL source code for the game
  • common will contain all data/type shared between different applications.
  • doc will contain documentation
  • iso_data is where the files from the DVD go
  • third-party will contain code we didn't write. Google Test is a git submodule in this folder.
  • tests will contain all tests
  • asset_tool will contain the asset packer/unpacker

Design

(if anybody has better ideas, feel free to suggest improvements! This is just a rough plan for now)

  • All C++ code should build from the top-level cmake.
  • All C++ applications (GOAL compiler, asset extractor, asset packer, runtime, test) should have a script in the top level which launches them.
  • All file paths should be relative to the jak folder.
  • The planned workflow for building a game:
    • git submodule update --init --recursive : check out gtest
    • mkdir build; cd build : create build folder for C++
    • cmake ..; make -j : build C++ code
    • cd ..
    • ./test.sh : run gtests
    • ./asset_extractor.sh ./iso_data : extract assets from game
    • ./build_engine.sh : run GOAL compiler to build all game code
    • ./build_game.sh : run the asset packer to build the game
    • ./run_game.sh : run the game
  • Workflow for development:
    • ./gc.sh : run the compiler in interactive mode
    • ./gs.sh : run a goos interpreter in interactive mode
    • `./decomp.sh : run the decompiler

Coding Guidelines

  • Avoid warnings
  • Use asserts over throwing exceptions in game code (throwing exceptions from C code called by GOAL code is sketchy)

TODOs

  • Clean up header guard names (or just use #pragma once?)
  • Investigate a better config format
    • The current JSON library seems to have issues with comments, which I really like
  • Clean up use of namespaces
  • Clean up the print message when gk starts.
  • Finish commenting runtime stuff
  • Runtime document
  • GOOS document
  • Listener protocol document
  • Gtest setup for checking decompiler results against hand-decompiled stuff
  • Clean up decompiler print spam, finish up the CFG stuff
  • Decompiler document

GOAL Decompiler

The decompiler is in progress, at https://github.com/water111/jak-disassembler

Here is the plan for writing the decompiler:

  • Decode the CGO/DGO format.
  • Decode the linking data format.
  • Identify all code and disassemble
  • Recover references
  • Split code into functions, and build a graph of basic blocks
  • Create a control flow graph for each function (currently succeeds for 9857/10410 functions)
  • Extract type/method information from debug data
  • Convert instructions to an intermediate representation (IR) and eliminate GOAL/MIPS idioms
  • Regsiter liveness analysis
  • Type propagation
  • Variable map and scoping
  • S-expression construction (expression stack)

GOAL Runtime

The "runtime" will be a replacement for all of the C/C++ code of the original game. There is C/C++ code that runs on the main processor (EE) and the separate I/O processor (IOP).

  • The "C Kernel", which runs on the EE and contains
    • File I/O (for debugging, not used by retail game)
    • Initialization to boostrap the GOAL Kernel and start the game engine
    • Connection to compiler for debugging/live code patching
    • Interface to OVERLORD (see next section) for DGO loading
    • GOAL Linker
    • PS2-specific hardware initialization as required by Sony libraries
    • GOAL "kheap" allocator
    • Memory card interface
    • GOAL printf (called format) implementation
    • GOAL hash/symbol table implementation
    • Implementation of some built-in GOAL methods/functions
  • The "OVERLORD" IOP driver, which ran on the PS2's separate I/O Processor for loading things off the DVD and doing sound things
    • DGO loader
    • File system for loading from DVD or "fakeiso" mode for loading from a folder
    • "ISOThread" queue system for prioritizing reads
    • Sound stuff
    • Streaming animation stuff
    • Ramdisk stuff (implemented but totally untested)
  • The "989_snd" sound driver (no progress has been made here, the rough plan is to do a high level emulation of the sound system)
  • Sony libraries
    • SIF (interface between EE/IOP for sending data, receiving data, and making remote procedure calls)
    • IOP Kernel (single-processor non-preemptive multitasking)
    • stubs for stuff that doesn't really matter

The "Sony libraries" are a simple wrapper around my system library, which implements the threading/communication stuff.

Likely there will be sound/graphics code in here at some point, but this part is not fully planned yet.

GOAL Compiler

The GOAL compiler will target x86-64. At first just Linux. There is a macro language called GOOS which is basically just Scheme but with a few bonus features.

The compiler will reuse a significant amount of code from my existing LISP compiler for x86-64. I have a very bad WIP combination which is capable of building a modified gkernel.gc for x86 as a proof of concept. It can create and run functions in threads.

An important part of the compiler is the test suite. Without tests the compiler will be full of bugs. So every feature should have a good set of tests.

Asset Extraction Tool

Not started yet. The simplest version of this tool is just to use the decompiler logic to turn the level/art-group/texture/TXT files into GOAL source code, and copy all STR/sound/visibility files, as these don't need to be modified.

Eventually this should export to a more useful format.

File formats:

  • Art group (a GOAL object format)
    • There may be more formats related to art groups.
  • Texture page (a GOAL object format)
    • Texture page directory (a GOAL object format)
  • Level (vis-bt) (a GOAL object format)
  • TEXT/*.TXT (text, a GOAL object format)
  • MUS (sequenced music)
  • SBK (sound bank)
  • STR (streaming animation)
  • VAG (ADPCM audio)
  • VIS (visibility data bitstream)
  • Loading screen image
  • save game icon (I do not care about this)

Asset Packing Tool

Packs together all assets/compiled code/runtime into a format that can be played. The simplest version to go with the simplest extraction tool will just pass the level/art-group/texture/TXT files to the compiler, and copy STR/sound/visbility files into the fakeiso. Then pack in CGOs/DGOs.

It's important that the asset extraction/packing can be automated so we can avoid distributing the assets, which are large and probably not supposed to be distributed.