Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Ai2 Scholar QA: Organized Literature Synthesis with Attribution

About

Retrieval-augmented generation is increasingly effective in answering scientific questions from literature, but many state-of-the-art systems are expensive and closed-source. We introduce Ai2 Scholar QA, a free online scientific question answering application. To facilitate research, we make our entire pipeline public: as a customizable open-source Python package and interactive web app, along with paper indexes accessible through public APIs and downloadable datasets. We describe our system in detail and present experiments analyzing its key design decisions. In an evaluation on a recent scientific QA benchmark, we find that Ai2 Scholar QA outperforms competing systems.

Amanpreet Singh, Joseph Chee Chang, Chloe Anastasiades, Dany Haddad, Aakanksha Naik, Amber Tanaka, Angele Zamarron, Cecile Nguyen, Jena D. Hwang, Jason Dunkleberger, Matt Latzke, Smita Rao, Jaron Lochner, Rob Evans, Rodney Kinney, Daniel S. Weld, Doug Downey, Sergey Feldman• 2025

Related benchmarks

TaskDatasetResultRank
Long-form researchDRB
Score36.1
39
Deep ResearchResearchQA
Score75
21
Deep ResearchSQA v2
Score87.7
18
Long-form researchResearchQA
Score75
18
Deep ResearchHealthBench
Score32
17
Long-form researchRESEARCHRUBRICS
Score38.1
16
Long-form researchHealthBench
Overall Score32
14
Deep Research report generation and action adherenceMYSQA dataset
Analysis Coverage88.9
6
Showing 8 of 8 rows

Other info

Follow for update