FiMI: A Domain-Specific Language Model for Indian Finance Ecosystem
About
We present FiMI (Finance Model for India), a domain-specialized financial language model developed by National Payments Corporation of India (NPCI) for Indian digital payment systems. We develop two model variants: FiMI Base and FiMI Instruct. FiMI adapts the Mistral Small 24B architecture through a multi-stage training pipeline, beginning with continuous pre-training on 68 Billion tokens of curated financial, multilingual (English, Hindi, Hinglish), and synthetic data. This is followed by instruction fine-tuning and domain-specific supervised fine-tuning focused on multi-turn, tool-driven conversations that model real-world workflows, such as transaction disputes and mandate lifecycle management. Evaluations reveal that FiMI Base achieves a 20\% improvement over the Mistral Small 24B Base model on finance reasoning benchmark, while FiMI Instruct outperforms the Mistral Small 24B Instruct model by 87\% on domain-specific tool-calling. Moreover, FiMI achieves these significant domain gains while maintaining comparable performance to models of similar size on general benchmarks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code Generation | HumanEval | Pass@181.1 | 850 | |
| Multi-task Language Understanding | MMLU | Accuracy77.99 | 842 | |
| Language Understanding | MMLU | Accuracy72 | 756 | |
| Question Answering | ARC Challenge | Accuracy61.86 | 749 | |
| Reasoning | BBH | Accuracy84.64 | 507 | |
| Mathematical Reasoning | GSM8K | Accuracy (GSM8K)90.37 | 358 | |
| Instruction Following | IFEval | -- | 292 | |
| Question Answering | TriviaQA | Accuracy52 | 210 | |
| Multitask Language Understanding | MMLU-Pro | Accuracy59.88 | 99 | |
| Reasoning | ARC | Accuracy56 | 83 |