Hi, On Tue, 13 Feb 2007, Linus Torvalds wrote: > 0x00 - which is the only one mmfile_is_binart() uses - is arguably the > "most binary" one, of course, but it might be interesting to give > different weights to the whole range.. In particular, especially for > small files, the fact that there is no 0x00 byte in no way indicates > that it's not "binary". Last time I checked, the text files never had lines longer than 200 characters (I chose this intentionally large). So, it might be a good heuristic to check the maximal line length, and refuse to believe that it's text once a certain (configurable) threshold is reached. Ciao, Dscho - To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.htmlReceived on Wed Feb 14 22:10:32 2007
This archive was generated by hypermail 2.1.8 : 2007-02-14 22:12:25 EST