Write a Blog >>
Mon 19 Jun 2017 11:15 - 11:40 at Actes, Civil Engineering - Learning and Probabilistic Chair(s): Swarat Chaudhuri

We present a scalable approach for establishing similarity
between stripped binaries (with no debug information). The main challenge is to establish similarity even when the code has been compiled using different compilers, with different optimization levels, or has been modified. Overcoming this challenge, while avoiding false positives, is invaluable to the process of reverse engineering, locating vulnerable code, and identifying \ac{IP} theft and plagiarism.

Finding similarity in binaries presents a natural tradeoff between the scalability of the approach, and its ability to identify semantic similarity which is crucial for precision. Previous techniques have been mostly heavily biased towards one of the ends of this spectrum. We present a technique that is scalable, precise and architecture-agnostic. It works by decomposing binary procedures to comparable segments, lifting segments to a \emph{canonical, optimized form} which allows for efficient semantic comparison, and then focusing comparisons on segments that are \emph{statistically significant} for establishing similarity.

We have implemented our technique in a tool called GitZ and performed an extensive evaluation. We show that GitZ is able to perform millions of comparisons efficiently, and find similarity with high accuracy.

Mon 19 Jun

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

10:50 - 12:30
Learning and ProbabilisticPLDI Research Papers at Actes, Civil Engineering
Chair(s): Swarat Chaudhuri Rice University
10:50
25m
Talk
DemoMatch: API Discovery from Demonstrations
PLDI Research Papers
Kuat Yessenov MIT, Ivan Kuraj MIT CSAIL, USA, Armando Solar-Lezama MIT CSAIL
Media Attached
11:15
25m
Talk
Similarity of Binaries through re-Optimization
PLDI Research Papers
Yaniv David Technion, Nimrod Partush Technion, Eran Yahav Technion
11:40
25m
Talk
Synthesizing Program Input Grammars
PLDI Research Papers
Osbert Bastani Stanford University, Rahul Sharma Microsoft Research, Alex Aiken Stanford University, Percy Liang Stanford University
Media Attached
12:05
25m
Talk
Compiling Markov Chain Monte Carlo Algorithms for Probabilistic Modeling
PLDI Research Papers
Daniel Huang Harvard University, Jean-Baptiste Tristan Oracle Labs, Greg Morrisett Cornell University
Media Attached