English 中文(简体)
Splitting a multipage TIFF image - With .NET and IronPython
原标题:

I had a scanned multipage TIFF image and needed to split each page out into individual files.

This is easy to do in by leveraging the .NET framework and C#, but since I did not have all the development tools installed on the machine I was using, I instead opted to use IronPython (via ipy.exe) to quickly script the processing logic.

Using Stack Overflow as a blog engine, I ll provide an answer to my own question. Comments, suggestions, alternatives, etc. are welcome!

最佳回答

Here is one way to do this - tweak as needed.


import clr
clr.AddReference("System.Drawing")

from System.Drawing import Image
from System.Drawing.Imaging import FrameDimension
from System.IO import Path

# sourceFilePath - The full path to the tif image on disk (e.g path = r"C:filesmultipage.tif")
# outputDir - The directory to store the individual files.  Each output file is suffixed with its page number.
def splitImage(sourceFilePath, outputDir):
     img = Image.FromFile(sourceFilePath)

     for i in range(0, img.GetFrameCount(FrameDimension.Page)):

         name = Path.GetFileNameWithoutExtension(sourceFilePath)
         ext = Path.GetExtension(sourceFilePath)
         outputFilePath = Path.Combine(outputDir, name + "_" + str(i+1) + ext)

         frameDimensionId = img.FrameDimensionsList[0]
         frameDimension = FrameDimension(frameDimensionId)

         img.SelectActiveFrame(frameDimension, i)
         img.Save(outputFilePath, ImageFormat.Tiff)
问题回答

One downside to doing it this way is that the image data was decompressed and then re-compressed when it was saved. This is not a problem if your compression is lossless (just time and memory), but if you are using JPEG compression for the images inside the TIFF, you will lose quality.

There are ways to do this using libtiff directly -- I don t know of any other non-commercial tools that can do it. Basically, you need to find the TIFF directory entries in the file that relate to the image data and copy them directly into a new TIFF without decoding them and reencoding. Depending on how much you want to do, you may need to fix offsets in the entries (e.g. if you are also bringing over the meta-data)

If you are interested in being able to split, merge, remove pages from or reorder TIFF documents without losing quality (and also faster and using less memory), take a look at my company s product, DotImage, and look at the TiffDocument class. This CodeProject article shows how to do it.





相关问题
Manually implementing high performance algorithms in .NET

As a learning experience I recently tried implementing Quicksort with 3 way partitioning in C#. Apart from needing to add an extra range check on the left/right variables before the recursive call, ...

Anyone feel like passing it forward?

I m the only developer in my company, and am getting along well as an autodidact, but I know I m missing out on the education one gets from working with and having code reviewed by more senior devs. ...

How do I compare two decimals to 10 decimal places?

I m using decimal type (.net), and I want to see if two numbers are equal. But I only want to be accurate to 10 decimal places. For example take these three numbers. I want them all to be equal. 0....

Exception practices when creating a SynchronizationContext?

I m creating an STA version of the SynchronizationContext for use in Windows Workflow 4.0. I m wondering what to do about exceptions when Post-ing callbacks. The SynchronizationContext can be used ...

Show running instance in single instance application

I am building an application with C#. I managed to turn this into a single instance application by checking if the same process is already running. Process[] pname = Process.GetProcessesByName("...

How to combine DataTrigger and EventTrigger?

NOTE I have asked the related question (with an accepted answer): How to combine DataTrigger and Trigger? I think I need to combine an EventTrigger and a DataTrigger to achieve what I m after: when ...

热门标签