Skip to content

LiveCaption OCR - A real-time screen text extraction tool using Tesseract OCR. Capture, recognize, and log on-screen text dynamically. Future updates will include on-demand language installation, resizable selection areas, and live text overlays.

License

Notifications You must be signed in to change notification settings

CarloSacchi/Live-Caption-OCR-Reader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🖥️ LiveCaption OCR - Real-Time Screen Text Extraction

LiveCaption OCR is an open-source real-time screen text extraction tool that allows users to capture on-screen text dynamically using Tesseract OCR. With an easy-to-use interface, you can select an area of the screen and extract text continuously, making it ideal for live captions, subtitles, or instant text recognition.

🚀 Planned future enhancements include:

  • On-demand language installation for Tesseract OCR
  • Dynamically resizable selection area instead of a fixed rectangle
  • Improved multi-monitor support
  • Live text and Trasnlation overlay feature

📌 Features

Real-time OCR processing using Tesseract OCR
Multi-language support (English, Italian, more in the future)
Logs and saves extracted text to a file ✔ Hotkey Support: Ctrl+Q to stop capture


🛠️ Prerequisites

Before installing, ensure you have the following:

  • Python 3.x installed
  • Tesseract OCR installed (Download it here)
  • Required Python libraries

👥 Installation

1️⃣ Install Python & Dependencies

pip install pytesseract pillow keyboard pyautogui tkinter

2️⃣ Install Tesseract OCR

📉 Windows users:
Download & install Tesseract from here.
Then, update the script with the correct Tesseract path:

TESSDATA_PREFIX = r'C:\Program Files\Tesseract-OCR\tessdata'
TESSERACT_CMD = r'C:\Program Files\Tesseract-OCR\tesseract.exe'

▶️ Running the Application

After installing the dependencies, clone the repository and run the script:

git clone https://github.com/your-username/LiveCaptionOCR.git
cd LiveCaptionOCR
python Live-Caption-OCR-Reader.py

🖥️ How to Use

1️⃣ Select a language from the dropdown menu (Ita or Eng).
2️⃣ Click "Start (Select Area)" to open the selection window.
3️⃣ Drag the yellow rectangle over the area you want to capture.
4️⃣ Press OK to start real-time OCR.
5️⃣ Press Ctrl+Qor STOP to stop the capture.

LiveOCR Start Capturing

LiveOCR Main Screen

📂 Captured text is saved in a file named in the same folder as the Python file, capture_YYYY-MM-DD-HH-MM-SS.txt.

LiveOCR Text Files

LiveOCR Text Output


📊 Future Development

This project is open-source and under active development. Planned features include:

  • 📉 Dynamic area selection (resizable & movable)
  • 📉 On-demand installation of additional OCR languages
  • 📉 Better multi-monitor support
  • 📉 Live translator with DeepL or Google Translate API
  • 📉 I improvement
  • 📉 Cloud integration for storage

If you want to contribute, feel free to submit pull requests or feature requests! 🚀


📚 License

This project is licensed under the MIT License, allowing free use and modification.

👤 Author: Carlo Sacchi

About

LiveCaption OCR - A real-time screen text extraction tool using Tesseract OCR. Capture, recognize, and log on-screen text dynamically. Future updates will include on-demand language installation, resizable selection areas, and live text overlays.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages