push.rocks/smartocr

Fork 0

Go to file

Philipp Kunz 8dd9bea8bb 1.0.9

2024-04-18 16:43:32 +02:00

.gitea/workflows

fix(core): update

2024-04-18 16:43:32 +02:00

.vscode

fix(core): update

2021-11-28 23:15:14 +01:00

test

fix(core): update

2024-04-18 16:43:32 +02:00

fix(core): update

2024-04-18 16:43:32 +02:00

.gitignore

fix(core): update

2021-11-28 23:15:14 +01:00

.gitlab-ci.yml

fix(core): update

2024-04-18 16:43:32 +02:00

license

fix(core): update

2021-11-28 23:15:14 +01:00

npmextra.json

update tsconfig

2024-04-14 18:05:46 +02:00

package.json

1.0.9

2024-04-18 16:43:32 +02:00

pnpm-lock.yaml

fix(core): update

2024-04-18 16:43:32 +02:00

readme.hints.md

update tsconfig

2024-04-14 18:05:46 +02:00

readme.md

update tsconfig

2024-04-14 18:05:46 +02:00

tsconfig.json

update npmextra.json: githost

2024-04-01 21:37:04 +02:00

readme.md

@push.rocks/smartocr

an ocr module using ocrmypdf

Install

To install @push.rocks/smartocr, use the following command with npm:

npm install @push.rocks/smartocr --save

This module depends on a few external utilities like ocrmypdf, so make sure you have these installed and available in your system's PATH. Consult the ocrmypdf documentation for installation instructions suitable for your operating system.

Usage

This module provides a TypeScript interface for OCR processing of PDF documents using ocrmypdf, encapsulated in the SmartOcr class. Here's how to leverage it in your TypeScript project.

Preparing Your Project

First, ensure your TypeScript configuration is set up to handle ESModule syntax. You will also need to have Node.js and the external ocrmypdf tool installed on your system.

Basic Setup

import { SmartOcr } from '@push.rocks/smartocr';

async function runOcrOnPdf(pdfFilePath: string): Promise<Buffer> {
  // Initialize the SmartOcr instance
  const smartOcrInstance = await SmartOcr.createAndInit();

  // Load your PDF file into a Buffer, this can be from a file or even a remote source
  const pdfBuffer = await fs.promises.readFile(pdfFilePath);

  // Process the PDF Buffer through SmartOcr
  const processedBuffer = await smartOcrInstance.processPdfBuffer(pdfBuffer);

  return processedBuffer;
}

// Replace './path/to/your/document.pdf' with the actual path to the PDF document you want to OCR
const ocredPdfBuffer = await runOcrOnPdf('./path/to/your/document.pdf');

// You can now save this buffer to a file, or use it as needed in your application
await fs.promises.writeFile('./path/to/output/document_ocr.pdf', ocredPdfBuffer);

In the example above, we import the SmartOcr class and use it to process a PDF by passing a Buffer of the PDF file to the processPdfBuffer method. The method returns a Buffer of the processed PDF which includes a text layer added by OCR.

Advanced Usage

The SmartOcr class maintains an internal smartshell instance to interface with the ocrmypdf command. This setup is abstracted away, ensuring you don't need to manage or understand the underlying shell commands to use OCR functionality in your application.

Handling OCR Result

The result of the processPdfBuffer is a Buffer that contains the OCR-processed PDF. This buffer can be directly written to a file system or further manipulated in memory, depending on your application's needs.

Error Handling

It's important to handle errors that may arise from reading files or the OCR process. The OCR process depends on the external ocrmypdf utility, so errors can occur if the utility encounters unsupported PDF structures or if there are issues with the installation of ocrmypdf.

try {
  const ocredPdfBuffer = await runOcrOnPdf('./path/to/your/document.pdf');
  await fs.promises.writeFile('./path/to/output/document_ocr.pdf', ocredPdfBuffer);
} catch (error) {
  console.error('Failed to OCR the document:', error);
}

Conclusion

The @push.rocks/smartocr library simplifies adding OCR capabilities to your TypeScript applications by abstracting away the complexity of interfacing with ocrmypdf. With minimal setup, you can start processing PDF documents to add searchable text layers, making this library a valuable tool for any project that requires OCR functionality.

License and Legal Information

This repository contains open-source code that is licensed under the MIT License. A copy of the MIT License can be found in the license file within this repository.

Please note: The MIT License does not grant permission to use the trade names, trademarks, service marks, or product names of the project, except as required for reasonable and customary use in describing the origin of the work and reproducing the content of the NOTICE file.

Trademarks

This project is owned and maintained by Task Venture Capital GmbH. The names and logos associated with Task Venture Capital GmbH and any related products or services are trademarks of Task Venture Capital GmbH and are not included within the scope of the MIT license granted herein. Use of these trademarks must comply with Task Venture Capital GmbH's Trademark Guidelines, and any usage must be approved in writing by Task Venture Capital GmbH.

readme.md

@push.rocks/smartocr

Install

Usage

Preparing Your Project

Basic Setup

Advanced Usage

Handling OCR Result

Error Handling

Conclusion

License and Legal Information

Trademarks

Company Information