OCR-ReportLab

Note

OCR-ReportLab is a collection of Colab notebooks designed to perform Optical Character Recognition (OCR) on images and generate DOCX or PDF documents containing both the original image and the extracted text. It supports multiple state-of-the-art vision-language models for experimentation and practical use.

Notebooks

You can launch and run the following notebooks directly in Google Colab:

Nanonets OCR: Open in Colab
Monkey OCR: Open in Colab
OCRFlux 3B: Open in Colab
Typhoon OCR: Open in Colab

Features

Extracts text from input images using various OCR models
Embeds the image and extracted text into DOCX or PDF formats
Designed for quick deployment via Google Colab

Supported Models

The repository currently supports the following OCR implementations:

Nanonets OCR
Monkey OCR
OCRFlux 3B
Typhoon OCR 3B

Installation

No installation is required. Simply click on the links above to run the notebooks in Google Colab. Make sure to upload your image file(s) when prompted and follow the instructions in the notebook.

Other Images

OCR

Caption

Dependencies

The notebooks are built using:

Python
PyTorch
Hugging Face Transformers
ReportLab
Gradio (for UI)
(Qwen2.5-VL based)

All dependencies are automatically installed in the Colab environment.

Author

Created and maintained by PRITHIVSAKTHIUR

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Megalodon-OCR-Sync-0713-ColabNotebook		Megalodon-OCR-Sync-0713-ColabNotebook
MonkeyOCR-0709		MonkeyOCR-0709
OCRFlux3B		OCRFlux3B
monkey-OCR		monkey-OCR
nanonets-OCR		nanonets-OCR
typhoon-OCR		typhoon-OCR
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OCR-ReportLab

Notebooks

Features

Supported Models

Installation

Other Images

Dependencies

Author

About

Uh oh!

Languages

License

PRITHIVSAKTHIUR/OCR-ReportLab

Folders and files

Latest commit

History

Repository files navigation

OCR-ReportLab

Notebooks

Features

Supported Models

Installation

Other Images

Dependencies

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages