SharePoint's OCR will finally support hybrid PDFs (images and text)
Previously it only supported image-based PDF.
1 min. read
Published on
Read our disclosure page to find out how can you help Windows Report sustain the editorial team. Read more
SharePoint’s OCR (Optical Character Recognition), which extracts printed or handwritten text from images and documents, will finally support the PDF format, which includes text and images.
SharePoint users have long requested this support, and according to the latest entry in the Microsoft 365 Roadmap, the Redmond-based tech giant should enhance the platform with it this month.
Here’s what the entry says:
SharePoint OCR feature now extends support to hybrid PDFs, which contain both images and text. Previously, OCR was limited to image-only PDFs, but with this update, you can seamlessly extract and utilize text from hybrid documents.
In SharePoint, OCR currently supports the following formats: .bmp, .png, .jpeg, .jpg, .jfif, .arw, .cr2, .crw, .erf, .gif, .mef, .mrw, .nef, .nrw, .orf, .pef, .raw, .rw2, .rw1, .sr2, .tif, .tiff, .heic, .heif, .ari, .bay, .cap, .cr3, .dcs, .dcr, .drf, .eip, .fff, .iiq, .k25, .kdc, .mef, .mos, .ptx, .pxn, .raf, .rwl, .sr2, .srf, .srw, .x3f, .dng, .tiff, and .pdf (image only).
In other news, SharePoint will also be enhanced with Authoring Copilot, allowing users to use AI to create high-quality web pages.
User forum
0 messages