Ad 728 × 90

Breaking News

random

Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual) https://ift.tt/pcGH7IK

Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual) Hi HN, I’ve been working on an OCR pipeline specifically optimized for machine learning dataset preparation. It’s designed to process complex academic materials — including math formulas, tables, figures, and multilingual text — and output clean, structured formats like JSON and Markdown. Some features: • Multi-stage OCR combining DocLayout-YOLO, Google Vision, MathPix, and Gemini Pro Vision • Extracts and understands diagrams, tables, LaTeX-style math, and multilingual text (Japanese/Korean/English) • Highly tuned for ML training pipelines, including dataset generation and preprocessing for RAG or fine-tuning tasks Sample outputs and real exam-based examples are included (EJU Biology, UTokyo Math, etc.) Would love to hear any feedback or ideas for improvement. GitHub: https://ift.tt/hafADdz https://ift.tt/hafADdz April 2, 2025 at 10:48PM
Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual) https://ift.tt/pcGH7IK Reviewed by Technology World News on April 03, 2025 Rating: 5

No comments:

Contact Form

Name

Email *

Message *

Powered by Blogger.