How do I extract images from a docx (word) file?

41 views (last 30 days)
I am trying to extract images for processing from a docx (Word) file. How do I do this?

Answers (2)

Stephen23
Stephen23 on 21 May 2019
Edited: Stephen23 on 21 May 2019
All OpenOffice XML formats (e.g. .docx, .xlsx, etc.) constitute XML files and supporting files zipped together into one file. You can simply access the image files by:
  1. Unzip the .docx into a folder (e.g. with the same name as the document).
  2. Open that folder, browse to the subfolder word\media to find all of the image files.
Use a reliable tool for unzipping, e.g. 7-zip, in which case the first step is trivial with a right-click on the document in windows explorer -> select "7-zip" -> and then choosing from its menu.
  1 Comment
Walter Roberson
Walter Roberson on 21 May 2019
It is also possible to use Java methods to pull an entry from a zip archive without unzipping the whole thing. Jan posted some code about a week ago that used that interface but it might have been in a resurrected older question.

Sign in to comment.


Gagan Bhangu
Gagan Bhangu on 29 Jul 2020
Yes, you can easily extract all images from word document.
Method 1: Save your docx file as web page.
Open docx file in Microsoft Word and click on the File > Save as > and select the web page (.html) option from the save as type.
After that, you’ll see the HTML file and folder with images where you saved it on your PC.
Also, you can use Google Docs. In Docs, click on the file > Download and select save as web page.
Method 2: By Right Click on Image
You can Save image one by one fromt the MS Word by right click on the image. Select Save as picture option for this.
Method 3: Open .docx file as a .zip
Change file extension from .docx to .zip and open the zip file. Extract media folder.
  1 Comment
Walter Roberson
Walter Roberson on 29 Jul 2020
The question is more whether it is possible to use MATLAB to do the work.

Sign in to comment.

Categories

Find more on Read, Write, and Modify Image in Help Center and File Exchange

Products


Release

R2019a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!