RetDec is a retargetable machine-code decompiler based on LLVM.
Go to file
Petr Zemek 848cd4b6a3 Mention that Flex >= 2.6 is needed in README.
Some of our users reported that version 2.5 does not work for them.
2018-01-01 13:37:18 +01:00
cmake Remove a forgotten variable from install-share.sh that causes installation to fail (#63). 2017-12-21 07:31:39 +01:00
deps Updated fnc-patterns submodule 2017-12-28 17:06:54 +01:00
doc doc: add graph of git submodule dependencies. 2017-12-20 16:01:26 +01:00
include Initial commit. 2017-12-12 18:05:30 +01:00
scripts Cosmetic fixes and style unifications in CMakeLists.txt files. 2017-12-25 07:56:16 +01:00
src Allow 32b Windows versions of bin2llvmir and llvmir2hll handle larger addresses (#73). 2017-12-31 10:16:26 +01:00
tests Cosmetic fixes and style unifications in CMakeLists.txt files. 2017-12-25 07:56:16 +01:00
.gitignore All all build* directories into .gitignore. 2017-12-25 07:40:54 +01:00
.gitmodules Initial commit. 2017-12-12 18:05:30 +01:00
CHANGELOG.md Unify formatting in CHANGELOG. 2017-12-31 10:39:23 +01:00
CMakeLists.txt Cosmetic fixes and style unifications in CMakeLists.txt files. 2017-12-25 07:56:16 +01:00
Dockerfile Add development Dockerfile 2017-12-28 11:29:32 -05:00
Dockerfile.dev Add development Dockerfile 2017-12-28 11:29:32 -05:00
LICENSE Initial commit. 2017-12-12 18:05:30 +01:00
LICENSE-THIRD-PARTY Initial commit. 2017-12-12 18:05:30 +01:00
README.md Mention that Flex >= 2.6 is needed in README. 2018-01-01 13:37:18 +01:00

RetDec

RetDec is a retargetable machine-code decompiler based on LLVM.

The decompiler is not limited to any particular target architecture, operating system, or executable file format:

  • Supported file formats: ELF, PE, Mach-O, COFF, AR (archive), Intel HEX, and raw machine code.
  • Supported architectures (32b only): Intel x86, ARM, MIPS, PIC32, and PowerPC.

Features:

  • Static analysis of executable files with detailed information.
  • Compiler and packer detection.
  • Loading and instruction decoding.
  • Signature-based removal of statically linked library code.
  • Extraction and utilization of debugging information (DWARF, PDB).
  • Reconstruction of instruction idioms.
  • Detection and reconstruction of C++ class hierarchies (RTTI, vtables).
  • Demangling of symbols from C++ binaries (GCC, MSVC, Borland).
  • Reconstruction of functions, types, and high-level constructs.
  • Integrated disassembler.
  • Output in two high-level languages: C and a Python-like language.
  • Generation of call graphs, control-flow graphs, and various statistics.

For more information, check out our

Installation and Use

Currently, we support only Windows (7 or later), Linux, and unofficially macOS.

Warning: Decompilations of larger binaries (1 MB or more) may require a lot of RAM. When running decompilations, we advise you to limit the maximal virtual memory for processes before decompiling to prevent potential swapping and unresponsiveness. On Linux, you can run e.g. ulimit -Sv 9863168 in your shell to limit the maximal virtual memory to 8 GB.

Windows

  1. Either download and unpack a pre-built package from the following list, or build and install the decompiler by yourself (the process is described below):

  2. Install Microsoft Visual C++ Redistributable for Visual Studio 2015.

  3. Install MSYS2 and other needed applications by following RetDec's Windows environment setup guide.

  4. Now, you are all set to run the decompiler. To decompile a binary file named test.exe, go into $RETDEC_INSTALL_DIR/bin and run:

    bash decompile.sh test.exe
    

    For more information, run bash decompile.sh --help.

Linux

  1. There are currently no pre-built packages for Linux. You will have to build and install the decompiler by yourself. The process is described below.

  2. After you have built the decompiler, you will need to install the following packages via your distribution's package manager:

  3. Now, you are all set to run the decompiler. To decompile a binary file named test.exe, go into $RETDEC_INSTALL_DIR/bin and run:

    ./decompile.sh test.exe
    

    For more information, run ./decompile.sh --help.

macOS

Warning: macOS build was added based on community feedback and is not directly supported by the RetDec team. We do not guarantee you that these instructions will work for you. If you encounter any problem with your build, submit an issue so the macOS community can help you out.

  1. There are currently no pre-built packages for macOS. You will have to build and install the decompiler by yourself. The process is described below.

  2. After you have built the decompiler, you will need to install the following packages:

  3. Now, you are all set to run the decompiler. To decompile a binary file named test.exe, go into $RETDEC_INSTALLED_DIR/bin and run:

    # /usr/local/bin/bash if installed via Homebrew
    /path/to/gnu/bash ./decompile.sh test.exe
    

    For more information, run ./decompile.sh --help.

Build and Installation

Build in Docker

Build image

Building in Docker doesn't require installation of required libraries locally. This is a good option for trying out retdec without setting up the whole build toolchain.

To build the retdec docker image, run

docker build -t retdec .

This builds the container from the master branch of this repository.

To build the container using the local copy of the repository, fully clone the repository:

git submodule update --init --recursive

Then build the container using the development Dockerfile, Dockerfile.dev:

docker build -t retdec:dev . -f Dockerfile.dev

Run container

To decompile a binary, create a container to upload the binary to:

docker create --name retdec_init retdec

Upload the binary: Note the destination directory should be a directory with read/write permissions like /home/retdec/.

docker cp <file> retdec_init:/destination/path/of/binary

Commit the copied files into the container image:

docker commit retdec_init retdec:initialized

Run the decompiler:

docker run --name retdec retdec:initialized decompile.sh /destination/path/of/binary

Copy output back to host:

docker cp retdec:/destination/path/of/binary.c /path/to/save/file

Build and install locally

This section describes a manual build and installation of RetDec.

Requirements

Linux

On Debian-based distributions (e.g. Ubuntu), the required packages can be installed with apt-get:

sudo apt-get install build-essential cmake git perl python3 bash coreutils wget bc doxygen graphviz upx flex bison zlib1g-dev libtinfo-dev autoconf automake pkg-config m4 libtool

On RPM-based distributions (e.g. Fedora), the required packages can be installed with dnf:

sudo dnf install git cmake make gcc gcc-c++ perl python3 bash zlib-devel flex bison m4 coreutils autoconf automake libtool ncurses-devel wget bc doxygen graphviz upx pkg-config

On Arch Linux, the required packages can be installed with pacman:

sudo pacman -S base-devel cmake git perl python3 bash coreutils wget bc doxygen graphviz upx flex bison zlib ncurses autoconf automake pkg-config m4 libtool

Windows

  • Microsoft Visual C++ (version >= Visual Studio 2015 Update 2)
  • Git
  • MSYS2 and some other applications. Follow RetDec's Windows environment setup guide to get everything you need on Windows.
  • Active Perl. It needs to be the first Perl in PATH, or it has to be provided to CMake using CMAKE_PROGRAM_PATH variable, e.g. -DCMAKE_PROGRAM_PATH=/c/perl/bin.
  • Python (version >= 3.4)

macOS

  • Full Xcode installation (Command Line Tools are untested)
  • CMake (version >= 3.6)
  • Newer versions of Bison and Flex, preferably installed via Homebrew
  • coreutils (ensure that you have $(brew --prefix coreutils)/libexec/gnubin in your PATH)
  • wget
  • Python (version >= 3.4, macOS has 2.7)

Process

Warning: Currently, RetDec has to be installed into a clean, dedicated directory. Do NOT install it into /usr, /usr/local, etc. because our build system is not yet ready for system-wide installations. So, when running cmake, always set -DCMAKE_INSTALL_PREFIX=<path> to a directory that will be used just by RetDec. For more details, see #12.

  • Recursively clone the repository (it contains submodules):
    • git clone --recursive https://github.com/avast-tl/retdec
  • Linux:
    • cd retdec
    • mkdir build && cd build
    • cmake .. -DCMAKE_INSTALL_PREFIX=<path>
    • make && make install
  • Windows:
    • Open MSBuild command prompt, or any terminal that is configured to run the msbuild command.
    • cd retdec
    • mkdir build && cd build
    • cmake .. -DCMAKE_INSTALL_PREFIX=<path> -G<generator>
    • msbuild /m /p:Configuration=Release retdec.sln
    • msbuild /m /p:Configuration=Release INSTALL.vcxproj
    • Alternatively, you can open retdec.sln generated by cmake in Visual Studio IDE.
  • macOS:
    • cd retdec
    • mkdir build && cd build
    • # Apple ships old Flex & Bison, so Homebrew versions should be used.
      export CMAKE_INCLUDE_PATH="/usr/local/opt/flex/include"
      export CMAKE_LIBRARY_PATH="/usr/local/opt/flex/lib;/usr/local/opt/bison/lib"
      export PATH="/usr/local/opt/flex/bin:/usr/local/opt/bison/bin:$PATH"
      
    • cmake .. -DCMAKE_INSTALL_PREFIX=<path>
    • make && make install

You have to pass the following parameters to cmake:

  • -DCMAKE_INSTALL_PREFIX=<path> to set the installation path to <path>.
  • (Windows only) -G<generator> is -G"Visual Studio 14 2015" for 32-bit build using Visual Studio 2015, or -G"Visual Studio 14 2015 Win64" for 64-bit build using Visual Studio 2015. Later versions of Visual Studio may be used.

You can pass the following additional parameters to cmake:

  • -DRETDEC_DOC=ON to build with API documentation (requires Doxygen and Graphviz, disabled by default).
  • -DRETDEC_TESTS=ON to build with tests, including all the tests in dependency submodules (disabled by default).
  • -DCMAKE_BUILD_TYPE=Debug to build with debugging information, which is useful during development. By default, the project is built in the Release mode. This has no effect on Windows, but the same thing can be achieved by running msbuild with the /p:Configuration=Debug parameter.
  • -DCMAKE_PROGRAM_PATH=<path> to use Perl at <path> (probably useful only on Windows).

Repository Overview

This repository contains the following libraries:

  • bin2llvmir -- library of LLVM passes for translating binaries into LLVM IR modules.
  • debugformat -- library for uniform representation of DWARF and PDB debugging information.
  • dwarfparser -- library for high-level representation of DWARF debugging information.
  • llvm-support -- set of LLVM related utility functions.
  • llvmir2hll -- library for translating LLVM IR modules to high-level source codes (C, Python-like language).

This repository contains the following tools:

  • bin2llvmirtool -- frontend for the bin2llvmir library.
  • llvm2hlltool -- frontend for the llvmir2hll library.

This repository contains the following scripts:

  • decompile.sh -- the main decompilation script binding it all together. This is the tool to use for full binary-to-C decompilations.
  • Support scripts used by decompile.sh:
    • color-c.py -- decorates output C sources with IDA color tags -- syntax highlighting for IDA.
    • config.sh -- decompiler's configuration file.
    • decompile-archive.sh -- decompiles objects in the given AR archive.
    • fileinfo.sh -- a Fileinfo tool wrapper.
    • signature-from-library.sh -- extracts function signatures from the given library.
    • unpack.sh -- tries to unpack the given executable file by using any of the supported unpackers.
  • Other utility scripts:
    • decompile-all.sh -- decompiles all executables in the given directory and subdirectories.
    • run-unit-test.sh -- run all tests in the unit test directory.
    • utils.sh -- a collection of bash utilities.
  • retdec-idaplugin -- embeds RetDec into IDA (Interactive Disassembler) and makes its use much easier.
  • retdec-regression-tests-framework -- provides means to run and create regression tests for RetDec and related tools. This is a must if you plan to contribute to the RetDec project.
  • retdec-python -- Python library and tools providing easy access to our online decompilation service through its REST API.
  • vim-syntax-retdecdsm -- Vim syntax-highlighting file for the output from the RetDec's disassembler (.dsm files).

License

Copyright (c) 2017 Avast Software, licensed under the MIT license. See the LICENSE file for more details.

RetDec uses third-party libraries or other resources listed, along with their licenses, in the LICENSE-THIRD-PARTY file.

Contributing

See RetDec contribution guidelines.

Acknowledgements

This software was supported by the research funding TACR (Technology Agency of the Czech Republic), ALFA Programme No. TA01010667.