Abstract:
Paramanu-Ganita is a novel Auto Regressive (AR) decoder based language model developed for specialized mathematical tasks. Despite being significantly smaller than its mainstream counterparts, this model demonstrates excellent performance, outperforming even larger, more established LLMs.
Importance:
Paramanu-Ganita redefines the scalability and efficacy anticipated from high-performing LLMs, particularly in domain-specific applications like mathematics. Its success points to a shifting paradigm where smaller, more tailored models may outperform generalized giant models not only in effectiveness but also in efficiency.