Screenshot2Code (S2C) - OCR for Code Screenshots!

In a nutshell: Use Screenshot2Code to convert code screenshots to text while preserving its format and steering clear of clipboard limitations!

Usage: python3 screenshot2code.py <screenshot_filepath> <output_filepath>
Input: screenshot image (.jpg, .png)
Output: code pasted to output_filepath (can optionally copy to clipboard)

Python package link: https://pypi.org/project/screenshot2code/0.0.1/

Why is Screenshot2Code needed? Check out this Medium article.

Screenshot2Code vs. Existing Methods

Existing frameworks, libraries, and APIs tend to fall into one of two categories:
(1) proprietary and closed-source, or
(2) open-source but designed for images and not particularly code.
Generative AI - why not just ask AGI to do it?
- GPT-4 can handle image modalities, but it is not accessible to everyone and it is not specialized for this purpose
- ChatGPT does not accept image input.
- Google Bard as of its initial release cannot handle image modalities.
- Github CoPilot cannot handle image modalities.
- Codex cannot handle image modalities.

To the best of our knowledge, there are currently no open-source repositories or APIs that are specifically designed for converting a screenshot into code in a way that preserves syntactical information such as spacing, indentation, and newlines. While there are many OCR and ML tools that can recognize text and generate text from images such as OpenCV or Pytesseract, these tools are not specifically designed for code recognition and may not be able to preserve formatting and syntax information to the extent required for complex code.

Screenshot Formats

Screenshots almost always come in one of two formats depending on the OS.

macOS, Ubuntu, and other Linux-based OS's: .png
Windows: .jpg, .jpeg

Languages we Support

We support all 54 languages included in the Guesslang Python package, including Python, C, Shell, etc.

TODO

Fix the remainder of indentation issues (current post-processing method isn't comprehensive)
Detect whether the screenshot is code or just text
Train custom deep learning model for better and more customizable of OCR
Eliminate line numbers and other irrelevant information
Copy submitted code to clipboard

Troubleshooting

If it says that pip is not found when sourcing the virtualenv then consider python -m ensurepip --upgrade.

Authorship

This project is led by Seth Harding and Matthieu Desir.
For more information or to find out how to contribute to Screenshot2Code, please send an email! [email protected] or [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
bin		bin
screenshots		screenshots
tess_data_bak		tess_data_bak
.env.local		.env.local
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
clipboard.png		clipboard.png
deptree.txt		deptree.txt
lib64		lib64
output		output
python_input.png		python_input.png
python_output.png		python_output.png
pyvenv.cfg		pyvenv.cfg
requirements.txt		requirements.txt
s2c_output.png		s2c_output.png
screenshot2code.py		screenshot2code.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Screenshot2Code (S2C) - OCR for Code Screenshots!

Screenshot2Code vs. Existing Methods

Screenshot Formats

Languages we Support

TODO

Troubleshooting

Authorship

About

Releases

Packages

Contributors 2

Languages

License

austin-hua/Screenshot2Code

Folders and files

Latest commit

History

Repository files navigation

Screenshot2Code (S2C) - OCR for Code Screenshots!

Screenshot2Code vs. Existing Methods

Screenshot Formats

Languages we Support

TODO

Troubleshooting

Authorship

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages