Experienced AI/ML Engineer - Document Intelligence
Job Description
Experienced AI/ML Engineer - Document Intelligence Needed for a US-based cutting edge software agency (remote position)
We're seeking an experienced AI/ML Engineer to join our team and drive innovation in document intelligence solutions.
You'll be at the forefront of developing and deploying cutting-edge machine learning systems that extract, interpret, and process information from diverse document types at scale.
Key Responsibilities
- Design, develop, and deploy production-grade ML models for document understanding and information extraction.
- Build and optimize OCR pipelines for various document formats (structured, semi-structured, and unstructured).
- Develop NLP/NLU solutions for document classification, entity extraction, and semantic understanding.
- Collaborate with cross-functional teams to integrate ML solutions into production systems.
- Evaluate and improve model performance, accuracy, and processing efficiency.
- Stay current with latest advances in document AI, computer vision, and LLM technologies.
Required Qualifications
- 5+ years of professional experience in Machine Learning/AI engineering roles
- Deep expertise in document intelligence, including:
OCR technologies (Tesseract, AWS Textract, Google Document AI, Azure Form Recognizer, or similar)
Document layout analysis and table extraction
Handwriting recognition and form processing
- Strong programming skills in Python and ML frameworks (TensorFlow, PyTorch, scikit-learn)
- Experience with computer vision techniques and libraries (OpenCV, PIL, etc.)
- Proven track record of deploying ML models to production environments
- Strong understanding of ML fundamentals, model evaluation, and optimization techniques
Preferred Qualifications
- Experience with Large Language Models (LLMs) for document understanding.
- Knowledge of modern document AI architectures (LayoutLM, Donut, DocFormer, etc.).
- Experience with cloud platforms (AWS, GCP, Azure) and their ML services.
- Familiarity with MLOps practices and tools (MLflow, Kubeflow, etc.).
- Background in NLP/NLU and transformer-based models.
- Experience with data annotation workflows and active learning.
- Publications or contributions to open-source ML projects.
Technical Skills
- Languages: Python (required), experience with other languages a plus.
- ML/DL Frameworks: TensorFlow, PyTorch, Keras, scikit-learn.
- OCR & Document Processing: Extensive hands-on experience required.
- Cloud & Infrastructure: Docker, Kubernetes, CI/CD pipelines.
- Data Processing: Pandas, NumPy, data preprocessing and augmentation.
Email: [contactus@honovix.ai](mailto:contactus@honovix.ai)
https://honovix.ai/
Whatsapp: +13108498233