Exploring the World of AI, One Step at a Time

Applied Scientist with 2.5+ years specializing in NLP and Vision-Language Models for complex document intelligence and domain-specific reasoning.

Akhil Theerthala

About Me

I am a Senior Member Data Scientist at Perfios Software Solutions with 2.5+ years of hands-on experience in NLP and Vision-Language Models for document intelligence.

I have proven technical ownership across the full ML lifecycle — translating cutting-edge research into scalable production systems. My open-source work on Kuvera personal finance LLMs has attracted 35,000+ downloads on Hugging Face.

97.5% Latency Reduction
27.6% Accuracy Improvement
Apr 2025 – Present

Senior Member Data Scientist

Perfios Software Solutions

Pioneering VLMs (PaliGemma2) with LoRA for financial data; designing agentic reasoning workflows; developing reference-free document legibility algorithms.

Jun 2023 – Apr 2025

Member Data Scientist

Perfios Software Solutions

Reduced inference latency 8s→200ms via distillation; improved table detection by 27.6%; enhanced TSR with semantic row-detection.

Aug 2019 – May 2023

B.Tech in Aerospace Engineering

IIT Kharagpur

Publications

Selected Writings

Technical articles on machine learning, reasoning systems, and applied research.

Jan 2026

Density vs. Diversity in Data Selection

Analyzing dense vs diverse sampling strategies for VLM training on 15k-sample synthetic datasets.

Read More →
Apr 2025

Creating a Reasoning Dataset with No Budget

How I ranked 1st globally in the Reasoning Dataset Creation Challenge using synthetic data.

Read More →
Feb 2025

From Training Language Models to DeepSeek-R1

An overview of how training regimes evolved from classic approaches to modern reasoning models.

Read More →
Feb 2025

7 Practical PyTorch Tips

Lessons learned from production PyTorch development covering best practices and optimization.

Read More →

Notable Projects

Recognition & Certifications

Open Source Impact

Kuvera datasets and models with 35,000+ downloads. Active contributor to Hugging Science (AI-for-Food-Allergies).

Publications

AAAI 2026 Workshop (FinForge) and FinNLP @ EMNLP 2025 on behaviour-aware personal finance LLMs.

Certifications

The Reasoning Course (HuggingFace), Generative AI with LLMs (DeepLearning.AI), ML in Production (Coursera).