๐ Developed by XI TJKT 2 | 2024/2025 | โ Any commercial use or unauthorized exploitation is prohibited
Release:
โโโ โโโ โโโโโโโโโ โโโโโโโโ
โโโ โโโ โโโโโโโโโ โโโโโโโโ
โโโ โโโ โโโ โโโโโโโโ
โโโ โโโ โโโ โโโโโโโโ
โโโ โโโโโโโโ โโโ โโโโโโโโ
โโโ โโโโโโโโ โโโ โโโโโโโโ
-------------------------------
ILTE - Indigenous Language Translator Engine
The Indigenous Language Translator Engine (ILTE) now offers four distinct versions, each tailored to different translation needs:
- ๐ฑ ILTE-ALT (Optimized for Speed) โ A lightweight, dictionary-based translator optimized for fast, low-resource translations.
- ๐ง ILTE-ZS (Hybrid, Multi-Processing) โ Combines dictionary-based rules, RBMT, FST, semantic matching, and zero-shot translation while efficiently handling large text files.
- ๐ง ILTE-ADV (AI-Powered, Context-Aware) โ An advanced, AI-driven translation engine that integrates context awareness, semantic similarity, and zero-shot learning.
- ๐ฎ ILTE-ATI (Advanced Attention & Iterative Processing) โ The most sophisticated version with hierarchical normalization, iterative refinement, attention-based translation, and multi-level candidate selection.
- โ Dictionary-Based Lookup for direct translations.
- โ Basic Stemming for Indonesian (ID) & English (EN).
- โ Levenshtein Distance Matching for closest word lookup.
- โ Automated Confidence Scoring for accuracy estimation.
- โ Structured DOCX Report Generation.
- โ Low Memory Usage โ Optimized for lower-end machines.
- โก Dictionary + RBMT + FST + Semantic Matching + Zero-Shot Translation.
- โ๏ธ Handles Large Files Efficiently via chunking & batch multi-processing.
- ๐ ๏ธ Optimized Resource Management, cleans memory and GPU after processing.
- ๐ Auto-Parallelized Translation Pipeline.
- โณ Faster Preprocessing, No Unnecessary Computation.
- ๐ง Contextual Translation using IndoBERT & Sentence Transformers.
- ๐ Zero-Shot Learning for Handling Unknown Words.
- ๐ Pattern-Based Learning & Semantic Matching.
- ๐ ๏ธ Enhanced Translation Confidence Metrics.
- โก Leverages GPU Acceleration for Faster Processing.
- โจ Hierarchical Normalization for Better Preprocessing.
- ๐ Iterative Translation for Context Awareness.
- ๐ Attention-Based Translation for Multi-Level Candidate Generation.
- โ๏ธ Refined Confidence Scoring & Adaptive Refinement.
- โ Full Formatting Preservation in DOCX Reports.
- ๐ Optimized for Dynamic, Multi-Stage Translation Processes.
Feature | Model Used |
---|---|
Translation (ID-EN, EN-ID) | Helsinki-NLP/opus-mt-id-en , Helsinki-NLP/opus-mt-en-id |
Stemming | Sastrawi (Indonesian), SnowballStemmer (English) |
Fuzzy Matching | Levenshtein Distance |
Feature | Model Used |
---|---|
Dictionary-Based Lookup | JSON-based dictionary |
Rule-Based Translation (RBMT, FST) | Custom FST Rules |
Semantic Similarity | sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 |
Zero-Shot Translation | facebook/mbart-large-50-many-to-many-mmt |
Feature | Model Used |
---|---|
Contextual Embeddings | cahya/bert-base-indonesian-1.5G |
Semantic Matching | sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 |
Zero-Shot Classification | typeform/distilbert-base-uncased-mnli |
Translation (ID-EN, EN-ID) | Helsinki-NLP/opus-mt-id-en , Helsinki-NLP/opus-mt-en-id |
Feature | Model Used |
---|---|
Hierarchical Normalization | Regex + Dynamic Stemming |
Contextual Translation | sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 |
Iterative Processing | Multi-Level Candidate Refinement |
Translation (ID-DYK, DYK-ID) | Enhanced Dictionary Lookup + Semantic Matching |
Feature | ILTE-ALT | ILTE-ZS | ILTE-ADV | ILTE-ATI |
---|---|---|---|---|
Translation Approach | Dictionary | Hybrid | AI-Based | Attention-Based + Iterative |
Processing Speed | Fast | Moderate | Slower | Balanced |
Handling Large Files | Struggles | Efficient Chunking | Slower | Optimized Processing |
Memory Usage | Low | Moderate | High | Optimized |
Context Awareness | None | Partial | Strong | ๐ฎ Very Strong |
Idiomatic Expressions | Limited | Rule-Based | AI-Based | AI + Attention |
Parallelization | Minimal | Yes | DataLoader | Thread + Process Pool |
Zero-Shot Capability | No | Yes | Yes | Yes |
Best Use Case | Fast translation | Large text processing | Context-Aware | High-Accuracy, AI-Powered |
python engine_ALT.py
python engine_ZS.py
python engine_ADV.py
python engine_ATI.py
Choose the version that best suits your needs and contribute to indigenous language preservation. ๐
- โ ALT: For lightweight, dictionary-based translations.
- โ ZS: For handling large files efficiently with hybrid translation techniques.
- โ ADV: For AI-powered, context-aware translations.
- โ ATI For advanced attention towards content and context.