Students were given pseudonym identifiers which linked the data included in the final dataset.
The EdLight Research Portal is a secure, researcher-facing platform that provides access to up to 300,000 annotated student math work samples, metadata, and AI model-generated insights. Each image is tagged with misconceptions, content standards, and instructional context. The portal supports query filtering, comparison by subgroups, and visualization of learning patterns. Researchers can download de-identified data in analysis-ready formats to study historical trends and AI performance.
We support data extensions through custom annotation layers. Researchers can apply their own tagging schemas to existing work samples or request the ingestion of de-identified external performance data to enrich existing records for correlational studies.
How do teacher-generated tags and AI-predicted misconceptions align across different student populations?
How do patterns of misconception vary across classrooms using the same curriculum?
What are the most common historical misconceptions in 6-8th grade math based on up to 300k+ samples?
1. Submission of research proposal and IRB approval.
2. Data use agreement execution.
3. Secure portal account creation.
4. Access to historical datasets for download and analysis
External researchers are supported through structured onboarding, Q&A sessions, and data office hours. All image data is de-identified prior to access. Use cases with minors include IRB and district approvals where required. Role-based access ensures only authorized researchers can view or export subsets.