Commit Graph

27 Commits

Author SHA1 Message Date
Ronald Caesar
3cf17ac088 tools: fix CDoc ordering of types
Fixes CDoc ordering the sidebar and main page types in alphabetical
order. Only the sidebar should be sorted and the main page should
be rendered the way its header file was parsed, top to bottom.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2026-01-17 02:06:36 -04:00
Ronald Caesar
3f78168ce5 tools: create a rustdoc like documentation generator
While I do not like the Rust Language as a whole, their documentation
generator is the best I've ever seen. in any language. I want to
implement something like it for Ballistic.

Like I said in the README, I have absolutely zero motivation to create
a documentation generator so `cdoc.c` is made completely with AI. The
code is messy but the generated HTML files look beautiful.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2026-01-16 20:23:12 -04:00
Ronald Caesar
4cafd1bf67 docs: add ballistic cli section
Signed-off-by: Ronald Caesar <github43132@proton.me>
2026-01-14 23:37:59 -04:00
Ronald Caesar
de8e6e5a4d tools: add ballistic cli program
This program is used to test Ballistic engine's translation logic.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2026-01-14 23:28:28 -04:00
Ronald Caesar
f72da3e121 decoder: add ir opcode to metadata struct
Instead of using strcmp() on each decoded intruction's mnemonic to
translate it, we embedd an IR opcode into the struct. This is a very
barebones implementation and does not cover the entire ARM instruction
set. ARM instructions that does not have an IR opcode equivalent will be
marked with `OPCODE_TRAP` and should be implemented in the future.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2026-01-13 23:31:06 -04:00
Ronald Caesar
77d3e7f5cc tools: add coverage cli program
A simple program that prints to stdout the top 20 most common
instructions in an ARM64 binary file.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2026-01-11 01:51:53 -04:00
Ronald Caesar
a2981ec38e decoder/tools: Add comment for struct attribute array_index
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:44 -04:00
Ronald Caesar
4aa8335612 decoder: Make decoder API public
The decoder API is now suitable to be made public. decoder.h is the sole
entry point for the decoder and it has been moved to `include/`

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:43 -04:00
Ronald Caesar
8fbfbe0cdc tools: Add readme for tools folder
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:43 -04:00
Ronald Caesar
3ae2e5dfae decoder/tools: Add type hinting to scripts
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:42 -04:00
Ronald Caesar
f19cd7cc78 decoder/tools: Assign instruction indices seperatly
Assign indices in generate_a64_table.py at the very end, right before
generation. This is much safer than maintaining a running counter during
parsing.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:42 -04:00
Ronald Caesar
3c28b8f3e8 decoder/tools: Sort instructions in a bucket by priority
This ensures specific instructions are checked before generic ones.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:42 -04:00
Ronald Caesar
f2c923cad2 decoder: Implement hashing algorithm
Switch to hashing bits [32:21] instead of bits [27:20] and [7:4]. This
massively reduces the average amount of instructions in a bucket.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:41 -04:00
Ronald Caesar
2637ba6ed0 decoder: shrink the very bloated hash table array
512 instructions are being declared in every bucket.

Size per bucket:
512 x 8 bytes (ptr) = 4096 bytes

Total Table Size:
4096 buckets x 4096 bytes ≈ 16.7 MB

This will destroy the CPU cache performance.

To fix this we generate small seperate arrays for each bucket that
actually has content and point to them.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:41 -04:00
Ronald Caesar
1dce881f33 build: Add generate_a64_table.py to cmake
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:41 -04:00
Ronald Caesar
1cee684f0b decoder: Replace linear search with hash table
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:40 -04:00
Ronald Caesar
c24f4f6e80 tests: Add decoder fuzzer
I realized that a fuzzer made in python is way slower than a fuzzer in
C. So here you go.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:40 -04:00
Ronald Caesar
9f1a2a63ed decoder/tools: Add fuzzer script
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:40 -04:00
Ronald Caesar
d988272544 decoder/tools: Fix flawed decoder table generation logic
generate_a64_table.py only generated instruction variants. For
example, the instruction ADD has forms. The default form with bit 31
set to 0, and its variant with bit 31 set to 1. The script ignored the
default form and added the varient.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:39 -04:00
Ronald Caesar
99e91b4921 decoder/tools: Add decoder_cli program
This program takes a 32-bit hexidecimal representing an ARM instruction
and prints its corresponding mnemonic. This program will be used in
tandem with a python fuzzing script to verify the decoder table
generated by tools/generate_a64_table.py.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:39 -04:00
Ronald Caesar
2bea3a817f decoder/tools: Made python script more user-friendly
- Added default values for input/output directories
- Improve error handling for missing cli arguments

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:38 -04:00
Ronald Caesar
e0c76b5361 decoder: Implement linear search decoder
This will be replace with a hash table but I need a working
decoding implementation right now.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:38 -04:00
Ronald Caesar
7c62fb2d21 decoder: Rename arm64 global instructions array
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:37 -04:00
Ronald Caesar
c25abe080d decoder/tools: Generate arm64 instruction size constant
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:37 -04:00
Ronald Caesar
4e42c9050b decoder/tools: Generate decoder table header file
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:36 -04:00
Ronald Caesar
fe07956d6b decoder/tools: Add CLI args and more erorr handling
Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:36 -04:00
Ronald Caesar
0fee614994 decoder: Implement initial A64 instruction decoder
Adds a python script, tools/generate_a64_table.py, to parse ARM's
machine readable XML. The script generates a static C lookup table
containing instruction mnemonics, masks, and values.

Signed-off-by: Ronald Caesar <github43132@proton.me>
2025-12-12 18:11:36 -04:00