Similarity of Binaries through re-Optimization (PLDI 2017 - PLDI Research Papers)

Who

Yaniv David, Nimrod Partush, Eran Yahav

Track

PLDI 2017 PLDI Research Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 19 Jun 2017 11:15 - 11:40 at Actes, Civil Engineering - Learning and Probabilistic Chair(s): Swarat Chaudhuri

Abstract

We present a scalable approach for establishing similarity
between stripped binaries (with no debug information). The main challenge is to establish similarity even when the code has been compiled using different compilers, with different optimization levels, or has been modified. Overcoming this challenge, while avoiding false positives, is invaluable to the process of reverse engineering, locating vulnerable code, and identifying \ac{IP} theft and plagiarism.

Finding similarity in binaries presents a natural tradeoff between the scalability of the approach, and its ability to identify semantic similarity which is crucial for precision. Previous techniques have been mostly heavily biased towards one of the ends of this spectrum. We present a technique that is scalable, precise and architecture-agnostic. It works by decomposing binary procedures to comparable segments, lifting segments to a \emph{canonical, optimized form} which allows for efficient semantic comparison, and then focusing comparisons on segments that are \emph{statistically significant} for establishing similarity.

We have implemented our technique in a tool called GitZ and performed an extensive evaluation. We show that GitZ is able to perform millions of comparisons efficiently, and find similarity with high accuracy.

Yaniv David

Technion

Nimrod Partush

Technion

Eran Yahav

Technion

Israel

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 19 Jun
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

10:50 - 12:30	Learning and ProbabilisticPLDI Research Papers at Actes, Civil Engineering Chair(s): Swarat Chaudhuri Rice University

10:50 25m Talk		DemoMatch: API Discovery from Demonstrations PLDI Research Papers Kuat Yessenov MIT, Ivan Kuraj MIT CSAIL, USA, Armando Solar-Lezama MIT CSAIL Media Attached
11:15 25m Talk		Similarity of Binaries through re-Optimization PLDI Research Papers Yaniv David Technion, Nimrod Partush Technion, Eran Yahav Technion
11:40 25m Talk		Synthesizing Program Input Grammars PLDI Research Papers Osbert Bastani Stanford University, Rahul Sharma Microsoft Research, Alex Aiken Stanford University, Percy Liang Stanford University Media Attached
12:05 25m Talk		Compiling Markov Chain Monte Carlo Algorithms for Probabilistic Modeling PLDI Research Papers Daniel Huang Harvard University, Jean-Baptiste Tristan Oracle Labs, Greg Morrisett Cornell University Media Attached