English 中文(简体)
Reverse Engineering File Formats using AI Techniques
原标题:

This is to extend the question: Tools to help reverse engineer binary file formats

Are there any tools that are publicly available that uses clustering and/or data mining techniques to reverse engineer file formats?

For example, with the tool you would have a collection of files that have the same format and the output of the tool would be the generic structure?

最佳回答

If one had a truly efficient binary encoding format (ZIP files are an example), then the information content in each bit is high. Essentially, it will look like a perfect random number.

You can t infer anything from that without additional knowledge.

If the binary encoding isn t efficient, in theory, you have some faint chance of seeing structure. But this still sounds really hard; how do you even begin guessing where the boundaries of fields are?

The AI machine learning types will tell you, you can t learn anything unless you already "almost" know it. Often they succeed by encoding the the problem with problem-tokens that at least you can reason about.

I don t think you can do this without providing more information. Do you know anything about the file formats? Field sizes are always less than N bits? Only ASCII strings are encoded or vice versa?

问题回答

暂无回答




相关问题
Parsing RealLegal s .PTX file format

Anyone have experience parsing the RealLegal PTX file format? I m interested in creating a Mac reader client for this proprietary format. The PTX file format is in wide use for storing court and ...

Semantic (or symbolic) Representation of Music [closed]

I m looking to create a little app that allows the user to visual and control (mostly) classical music, in order to better recognize structures such as canon. What would be an appropriate data and ...

Can a CSV file have a comment?

Is there any official way to allow a CSV formatted file to allow comments, either on its own line OR at the end of a line? I tried checking wikipedia on this and also RFC 4180 but both do not mention ...

How to read Lotus Notes mail archives (*.nsf)

Does anyone know how to read these files without using the interops or COM interaction? Just the direct way. Is there any spec of this format or reverse engineered stuff that could help on this? ...

Reverse Engineering File Formats using AI Techniques

This is to extend the question: Tools to help reverse engineer binary file formats Are there any tools that are publicly available that uses clustering and/or data mining techniques to reverse ...

Identify file format from header?

I got some files , which has header, {AD08E961-F492-11d1-B0EA-00AA00683350}2.00:en_US:45015:021 Can you please help me in finding out, from which software it is created. Only information I have is,...

libsvm model file format

According to this FAQ the model format in libsvm should be straightforward. And in fact it is, when I call just svm-train. As an example, the first SV for the a1a dataset is 1 3:1 11:1 14:1 19:1 39:...

热门标签