Hey Chat, Can You Teach Me? Structuring Socratic Dialogue for Human Learning in the Wild

AI-generated keywords: Large language models Socratic dialogue curriculum sequencing student learning outcomes explicit curriculum structure

AI-generated Key Points

  • Large language models (LLMs) lack a structured curriculum to guide effective learning process
  • Scaling up LLMs does not bridge the gap in effective tutoring
  • Proposal of a novel approach that separates responsibilities for curriculum sequencing, conducting Socratic dialogue, and inferring student knowledge
  • Construction of a prerequisite knowledge graph to guide tutoring process by determining what to teach next and how many dialogue turns to allocate
  • Implementation of lightweight Proximal Policy Optimization (PPO) policy for sequencing decisions and leveraging an LLM for Socratic exchanges at each node
  • Significant improvements in student learning outcomes across STEM and non-STEM topics achieved by PPO-paired tutor compared to heuristic baselines and specialized models
  • Importance of explicit curriculum structure in enhancing learning outcomes beyond model scalability alone, drawing from educational psychology principles.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Sidney Tio, Arunesh Sinha, Pradeep Varakantham

10 Main Body Pages, with Appendices
License: CC BY 4.0

Abstract: Large language models are now widely used for everyday learning, but the underlying interactions are typically unstructured chats rather than following a curriculum. Unlike formal online learning systems, these interactions carry no prior record of the student, so any estimate of what the student already knows must be inferred from the dialogue itself. We show that this gap is not closed by scaling models alone. Frontier and education-tuned LLMs perform poorly when asked to tutor a student over an extended session, because doing so requires three things at once. The tutor must sequence a curriculum, conduct Socratic dialogue, and infer the student's knowledge state from that dialogue. We propose separating these responsibilities. Given a student query, our system constructs a prerequisite knowledge graph in which subtopics are nodes and dependencies are edges, and frames tutoring as deciding which node to teach next and how many dialogue turns to spend on it before moving on. A lightweight PPO policy handles this sequencing decision, while an LLM conducts the Socratic exchange at the chosen node and returns a signal of student progress. Across held-out STEM and non-STEM topics, our PPO-paired tutor outperforms heuristic baselines, frontier general-purpose models, and a model specialised for Socratic dialogue: on both the rate at which students reach full curriculum mastery and the number of turns required. Explicit curriculum structure delivers gains that scaling the underlying model does not.

Submitted to arXiv on 10 Jun. 2026

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2606.11744v1

In the study "Hey Chat, Can You Teach Me? Structuring Socratic Dialogue for Human Learning in the Wild," researchers Sidney Tio, Arunesh Sinha, and Pradeep Varakantham delve into the realm of large language models (LLMs) used for everyday learning. They highlight a key issue: while these models engage in unstructured chats with learners, they lack a structured curriculum to guide the learning process effectively. Unlike formal online learning systems that track student progress, LLM interactions rely solely on dialogue to infer a student's knowledge level. The researchers demonstrate that simply scaling up LLMs does not bridge this gap in effective tutoring. To address this challenge, they propose a novel approach that separates the responsibilities of curriculum sequencing, conducting Socratic dialogue, and inferring student knowledge. Their system constructs a prerequisite knowledge graph where subtopics are nodes and dependencies are edges. This framework guides the tutoring process by determining which node to teach next and how many dialogue turns to allocate before moving on. By implementing a lightweight Proximal Policy Optimization (PPO) policy for sequencing decisions and leveraging an LLM for Socratic exchanges at each node, the researchers achieve significant improvements in student learning outcomes across STEM and non-STEM topics. Their PPO-paired tutor outperforms heuristic baselines and specialized models for Socratic dialogue by enhancing both the rate at which students master the curriculum and reducing the number of turns required. Drawing from educational psychology principles emphasizing reasoning through intermediate steps, strategic content sequencing, and active student engagement under uncertainty for improved retention, this study underscores the importance of explicit curriculum structure in enhancing learning outcomes beyond model scalability alone.
Created on 03 Jul. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.