SharePoint's OCR will finally support hybrid PDFs (images and text)

Previously it only supported image-based PDF.

Reading time icon 1 min. read


Readers help support Windows Report. We may get a commission if you buy through our links. Tooltip Icon

Read our disclosure page to find out how can you help Windows Report sustain the editorial team. Read more

SharePoint OCR

SharePoint’s OCR (Optical Character Recognition), which extracts printed or handwritten text from images and documents, will finally support the PDF format, which includes text and images.

SharePoint users have long requested this support, and according to the latest entry in the Microsoft 365 Roadmap, the Redmond-based tech giant should enhance the platform with it this month.

Here’s what the entry says:

SharePoint OCR feature now extends support to hybrid PDFs, which contain both images and text. Previously, OCR was limited to image-only PDFs, but with this update, you can seamlessly extract and utilize text from hybrid documents.

In SharePoint, OCR currently supports the following formats: .bmp, .png, .jpeg, .jpg, .jfif, .arw, .cr2, .crw, .erf, .gif, .mef, .mrw, .nef, .nrw, .orf, .pef, .raw, .rw2, .rw1, .sr2, .tif, .tiff, .heic, .heif, .ari, .bay, .cap, .cr3, .dcs, .dcr, .drf, .eip, .fff, .iiq, .k25, .kdc, .mef, .mos, .ptx, .pxn, .raf, .rwl, .sr2, .srf, .srw, .x3f, .dng, .tiff, and .pdf (image only).

In other news, SharePoint will also be enhanced with Authoring Copilot, allowing users to use AI to create high-quality web pages.

More about the topics: microsoft, Sharepoint

User forum

0 messages