Skip to content
View DaltonAlves's full-sized avatar

Block or report DaltonAlves

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tero Subtitler is an open source, cross-platform, and free subtitle editing software.

Pascal 317 21 Updated Jan 26, 2025

ocr-docker is small, Flask powerd web app, helps us to extract text from images and pdf document using OCR

CSS 51 14 Updated Jan 27, 2025

Vrecord is open-source software for capturing a video signal and turning it into a digital file.

Shell 162 46 Updated Feb 17, 2025

Rails application supporting the creation of OCR and the IIIF Content Search API

Ruby 34 6 Updated Dec 14, 2022

A client library for working with the ArchivesSpace API

Python 81 13 Updated Feb 21, 2025

This Guidance demonstrates how to validate checksums for compliance and audit requirements with an on-demand fixity check process.

JavaScript 14 3 Updated Oct 20, 2024

A React audio player & transcription viewer.

JavaScript 82 17 Updated Nov 26, 2023

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 19,139 1,272 Updated Feb 27, 2025

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 8,871 1,200 Updated Feb 24, 2025

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,696 83 Updated Feb 1, 2025

the subtitle editor :)

C# 9,625 958 Updated Feb 28, 2025

OCFL tools in Python

Python 21 7 Updated Jan 30, 2025

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python 1,492 169 Updated Feb 28, 2025

Identify, review, and remove sensitive files

Python 29 2 Updated Mar 5, 2023

Working with hOCR in Javascript

HTML 125 18 Updated Mar 4, 2023

View HOCR files with Mirador

Python 26 4 Updated Sep 27, 2017

Powerful Ruby date parser

Ruby 34 6 Updated Mar 6, 2023

A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats

Python 47 3 Updated Jul 22, 2024

Serverless replay of web archives directly in the browser

TypeScript 759 65 Updated Feb 27, 2025

Tesseract documentation

HTML 1,952 376 Updated Feb 5, 2025

Download an entire website from the Wayback Machine.

Ruby 5,482 724 Updated Feb 8, 2024

A feature-rich command-line audio/video downloader

Python 102,482 8,035 Updated Feb 28, 2025

A Rails engine supporting discovery of archival material

Ruby 40 27 Updated Jan 21, 2025

Automated date parsing plugin for ArchivesSpace

Ruby 13 6 Updated Dec 8, 2023

brozzler - distributed browser-based web crawler

Python 686 100 Updated Feb 25, 2025

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Java 2,916 762 Updated Feb 14, 2025

This repository shares NARA-created open source software to support federal agencies in their preparation of metadata and permanent electronic records for transfer to NARA.

11 Updated Aug 15, 2023

Uploads and downloads file inventories to and from ArchivesSpace

Python 6 1 Updated Apr 24, 2024

This is the general workflow to make archival information packages (AIPs) that are ready for ingest into the UGA Libraries' digital preservation system (ARCHive). The workflow organizes files, extr…

Python 4 Updated Oct 9, 2024

A search interface and wayback machine for the UKWA Solr based warc-indexer framework.

Java 109 21 Updated Jan 16, 2025
Next
Showing results