Manipulating racket bytecode
ZORDOZ speaks to you! His chosen ones.
This is an explorer for Racket .zo files.
Tested to work on Racket v6.5
For compatibility with older versions, see the
v6.1
and
v6.2
and
v6.3
branches of this repo.
(Note that installing through raco will choose the available version, if any, matching your Racket install.)
Typed Racket users can (require zordoz/typed) for type-annotated bindings from zordoz.
Also, zordoz/typed/zo-structs is a safe wrapper around Racket’s compiler/zo-structs library.
You have two options.
raco by running raco pkg install zordozraco:git clone https://github.com/bennn/zordoz; raco pkg install zordoz/To run tests, do raco test zordoz.
Tests are located in the test submodule of each source file.
Zordoz provides a raco zordoz command.
Activate the REPL by giving a path to a compiled file.
raco zordoz FILE.zo
Passing the -t option uses typed racket code.
Beware, the typed racket version is up to 5x slower than untyped because of contracts with the compiler/zo-lib structs.
The REPL accepts the following commands:
alst prints all command aliases; for example, the repl treats the words ‘alst’ and ‘aliases’ the same wayback goes back to the previous contextdive ARG changes context to a new zo struct or list (other dives are not permitted)find ARG searches for matches to ARG and, if successful, changes context to the list of resultshelp prints information about these commandsinfo prints data about the current contextjump reverts to a previously saved contextsave marks the current context as a target for jumpquit exits the interpreterThe functions implementing the dive, find, and info commands are available outside the REPL.
Check the guide for a summary.
Running:
./zordoz -f branch -f lam -f closure FILE.zo
Will count and print the number of times the zo structs branch lam and closure appear.
This may take a while, depending on the size of the bytecode file.
You can limit the search depth by passing a natural number with the -l flag.
See the decompilation guide for a list of all zo struct names.
Racket bytecode is stored in files with a .zo extension.
This tool makes it easier to explore the bytecode representation of a file, whether or not you have access to the file’s source code.
Given a .zo file, we decompile the bytecode into a struct (aka, a “zo-struct”) using Racket’s built-in decompilation API.
The REPL loads this struct as its initial context and begins accepting commands, making it easy to visualize and explore Racket bytecode.
Suppose we create and compile a small racket file:
> echo -e "#lang racket/base\n(if #t (+ 1 1) 0)" > test.rkt> raco make test.rkt
The actual bytecode is not human readable.
Neither is the struct representation output by zo-parse:
> echo -e '#lang racket/base\n(require compiler/zo-parse)\n(call-with-input-file "compiled/test_rkt.zo"\n (lambda (fd) (displayln (zo-parse fd))))' > print-test.rkt> racket print-test.rkt#s((compilation-top zo 0) 0 #s((prefix zo 0) 0 (#f) ()) #s((mod form 0 zo 0) test test #<module-path-index> #s((prefix zo 0) 0 (#s((module-variable zo 0) #<module-path-index> print-values 0 0 #s((function-shape zo 0) #(struct:arity-at-least 0) #f))) ()) ((0 () ()) (1 () ()) (#f () ())) ((0 #<module-path-index>) (1) (-1) (#f)) (#s((apply-values expr 0 form 0 zo 0) #s((toplevel expr 0 form 0 zo 0) 0 0 #t #t) 2)) () ((0 () ())) 0 #s((toplevel expr 0 form 0 zo 0) 0 0 #f #f) #f #t () (#s((mod form 0 zo 0) (test configure-runtime) configure-runtime #<module-path-index> #s((prefix zo 0) 0 (#s((module-variable zo 0) #<module-path-index> configure 0 0 #s((function-shape zo 0) 1 #f))) ()) ((0 () ()) (1 () ()) (#f () ())) ((0 #<module-path-index> #<module-path-index>) (1) (-1) (#f)) (#s((application expr 0 form 0 zo 0) #s((primval expr 0 form 0 zo 0) 1000) (#t))) () ((0 () ())) 1 #s((toplevel expr 0 form 0 zo 0) 0 0 #f #f) #f #t () () ())) ()))
ZORDOZ offers a more readable presentation.
Below is a sample interactive session with the same small file (interspersed with commentary):
> racket zordoz.rkt compiled/test_rkt.zoINFO: Loading bytecode file 'compiled/test_rkt.zo'...INFO: Parsing bytecode...INFO: Parsing complete!--- Welcome to the .zo shell, version 0.1 'outlands' ---zo> info<struct:compilation-top>max-let-depth : 0prefix : <struct:prefix>code : <struct:mod>
The compilation-top struct is at the top of most every .zo file.
Things get more interesting as we explore the structs nested inside it.
zo> dive codezo> info<struct:mod>name : testsrcname : testself-modidx : #<module-path-index>prefix : <struct:prefix>provides : 0 [] [] 1 [] [] #f [] []requires : 0 #<module-path-index> 1 -1 #fbody : <struct:apply-values>syntax-bodies :unexported : 0max-let-depth : 0dummy : <struct:toplevel>lang-info : #finternal-context : #tflags :pre-submodules : <struct:mod>[1]post-submodules : []
The mod struct represents a Racket module.
This module has the name test; inferred from our filename test.rkt.
We could continue dive-ing into structs, or we can use the shell’s find command to look for structs matching a name like mod or compilation-top.
Let’s search for branch structs.
Maybe we can find the if-statement in our original code.
zo> find branchFIND returned 0 results
Nothing.
The if-statement has been optimized away.
Let’s try to find what it turned into by searching the body of the module.
zo> dive bodyzo> info(<struct:apply-values>)[1]
The syntax (<struct:NAME>)[LENGTH] denotes a list of zo-structs.LENGTH is the number of elements in the list—we can dive into any valid index.
zo> dive 0zo> info<struct:apply-values>proc : <struct:expr>args-expr : 2
Looks like our if-statement was optimized into a constant, 2.
Happy exploring!