NANETH TIFFs

11
NANETH TIFFs “TIFF stands for Thousands of Incompatible File Formats

description

NANETH TIFFs. “TIFF stands for  Thousands of Incompatible File Formats ”. ...0000 0000 1 000 0000… …0000 0000 0 1 00 0000…. ...0000 0000 1 000 0000…. OR. …0000 0000 0 1 00 0000…. What does “valid” mean?. Well-formed? These images are! Verified by external tool? These images are! - PowerPoint PPT Presentation

Transcript of NANETH TIFFs

Page 1: NANETH TIFFs

NANETHTIFFs

“TIFF stands for Thousands of Incompatible File Formats”

Page 2: NANETH TIFFs
Page 3: NANETH TIFFs
Page 4: NANETH TIFFs
Page 5: NANETH TIFFs

...0000 0000 1000 0000…

…0000 0000 0100 0000…

Page 6: NANETH TIFFs

…0000 0000 0100 0000…

...0000 0000 1000 0000…

OR

Page 7: NANETH TIFFs

What does “valid” mean?

• Well-formed?– These images are!

• Verified by external tool?– These images are!

• Confirmed as matching spec?– Who knows?

• Internally consistent?– These images are definitely NOT!

Page 8: NANETH TIFFs

How do we detect it?

$ exiftool -n -if '$FileSize# * 8 < $ImageWidth * $ImageHeight * $BitsPerSample' -p '$filename is TOO SMALL - $FileSize - ($ImageWidth*$ImageHeight*$BitsPerSample)' *.tif

NL-HaNA_2.24.01.09_0_901-2649.tif is TOO SMALL - 9952834 - (3454*2877*16)

NL-HaNA_2.24.01.09_0_901-2809.tif is TOO SMALL - 9952834 - (3454*2877*16)

NL-HaNA_2.24.01.09_0_901-3431.tif is TOO SMALL - 9809944 - (2859*3425*16)

NL-HaNA_2.24.01.09_0_901-4419.tif is TOO SMALL - 9807680 - (3425*2859*16)

NL-HaNA_2.24.01.09_0_901-4451.tif is TOO SMALL - 9807680 - (3425*2859*16)

NL-HaNA_2.24.01.09_0_901-5197.tif is TOO SMALL - 9809944 - (2859*3425*16)

NL-HaNA_2.24.01.09_0_901-5556.tif is TOO SMALL - 9807680 - (3425*2859*16)

BUT: As it stands, this only works on grayscale TIFFs

Page 9: NANETH TIFFs

How do we FIX it?

$ exiftool -BitsPerSample=8 NL-HaNA_2.24.01.09_0_901-5556.tif

AGAIN: As it stands, this only works on grayscale TIFFs

Page 10: NANETH TIFFs

What do we take away?

• Don’t always assume your vendors/digitisers are doing the job right.

• Don’t always assume that “successful validation” is meaningful. (Also: learn the limitations of your tools.)

• The only thing better than double-checking is triple-checking.

• KNOW WHAT YOU ARE “PRESERVING”!

Page 11: NANETH TIFFs

One last time…