Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation

arXiv cs.CL·Mariam Barakat, Ekaterina Kochmar

3h ago

·~1 min·5/26/2026·en·0

Quick Take

A modular pipeline enhances educational analogy generation by systematically analyzing model choice and input configuration.

Key Points

Decomposes analogy generation into four stages.
Sub-concepts improve explanation quality significantly.
Claude Sonnet 4.6 aligns better with human rankings.

Article Content

From source RSS / original summary

arXiv:2605. 24211v1 Announce Type: new Abstract: Analogies help learners understand unfamiliar concepts by relating them to known concepts. Despite recent advances, large language models (LLMs) continue to struggle to generate analogies of comparable quality to those produced by humans. We present a modular pipeline for educational analogy generation, decomposing the task into four stages: source finding, sub-concept generation, explanation generation, and evaluation.

Grounded in Structure Mapping Theory, the pipeline enables systematic, stage-by-stage analysis of how model choice and input configuration affect analogy quality. We evaluate 12 state-of-the-art LLMs across six model families on two datasets with structured sub-concept annotations (SCAR and ParallelPARC), alongside seven embedding models for closed-setting retrieval.

Our results show that sub-concepts substantially improve explanation quality and closed setting retrieval precision but provide limited benefit in open-ended source generation. We further introduce an LLM-as-a-judge evaluation methodology and validate its scoring against human annotations from seven annotators, finding that Claude Sonnet 4. 6 aligns more reliably with human rankings than with fine-grained absolute scores.

Taken together, our findings reveal cross-stage interactions that isolated studies cannot capture, and highlight sub-concept grounding as a key driver of analogy quality generation.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation

Quick Take

Key Points

Article Content

Want this in your inbox every morning?

More from arXiv cs.CL

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

Extracting Training Data from Diffusion Language Models via Infilling

Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution