EduMap: From Static PDFs to Interactive Knowledge Graphs

An interactive mind-map generator leveraging multimodal LLMs to reduce cognitive load in academic reading

Spring 2026 CSCI 5541 NLP: Class Project - University of Minnesota

P(What's Next) = 1

Yana Jin

Nicole Vu

Farhana Anjum

Shaun Ting



The Problem & Our Solution

The Problem: Cognitive Overload in Academic Reading

Students and researchers regularly navigate dense, hierarchically structured PDFs. However, traditional PDF viewers force readers to process this complex information linearly, requiring them to mentally reconstruct the document's underlying structure. While existing AI summarizers (like NotebookLM) can condense text, they produce static outputs that strip away crucial visual and structural relationships, treating foundational concepts and highly technical arguments exactly the same.

Our Solution: Interactive Knowledge Mapping

EduMap shifts the paradigm from passive reading to active, interactive learning. By combining layout-aware parsing with advanced Large Language Models, EduMap automatically transforms long, complex academic papers into structured, editable mind maps. It serves not just as a summary, but as a personalized "study artifact" that reduces cognitive load, preserves document hierarchy, and helps users prioritize their focus.

Screenshot of the EduMap web application showing the mind map and editor

EduMap interface: PDF-derived mind map with synchronized editing.


Core Features

Dual-Synchronized View

Experience a seamless workflow with a side-by-side Markdown editing panel and an interactive React Flow mind map. Powered by a centralized state management system, any edit you make in the text or on the canvas stays perfectly in sync in real time.

Multimodal Ingestion

Academic papers are more than just text. EduMap's backend precisely extracts figures, complex formulas, and tables, flattening them into a comprehensive multimodal context. This ensures critical visual data is deeply integrated into your generated mind map.

Human-AI Collaboration & Trust

We designed EduMap to be a scaffolding for your thoughts, not a replacement. A unique pencil badge trust mechanism clearly distinguishes your manual edits from the original AI-generated baseline. Combined with a robust undo system and comprehensive export options, you retain complete control over your knowledge graph.


Technical Architecture

Multimodal Extraction Pipeline

To overcome the limitations of standard text-only extractors, our backend utilizes a Python-based, layout-aware parsing program. It intelligently categorizes content into body text, figures, tables, and formulas. For complex, vector-drawn figures that traditional methods miss, we engineered a caption-anchored region rendering technique. Images are then sent alongside their surrounding academic text to a Vision model (e.g., GPT-4o), ensuring descriptions are highly accurate and contextually grounded.

Two-Stage Harvest & Organize LLM Orchestration

To prevent the AI from simply copying physical section headers or experiencing structural hallucinations, EduMap uses a decoupled, two-step extraction strategy:

Resilient Engineering & State Synchronization

Behind the scenes, EduMap utilizes a custom two-way conversion logic between Markdown and the graph node structure, ensuring zero data loss or hierarchy disruption during user edits. To guarantee high availability, the infrastructure includes an automatic model failover mechanism that seamlessly switches between Gemini, OpenAI, and Claude APIs if network issues arise.


Evaluation & Results

Semantic vs. Structural Accuracy

Automated testing showed that while all systems achieved similar BERTScore F1 (approx. 0.79), EduMap achieved a significantly higher Cosine Similarity (0.395) compared to NotebookLM (0.211), indicating superior structural alignment with original documents.

BERTScore F1 vs Cosine Similarity by System

BERTScore F1 vs Cosine Similarity by System.

Educational Quality

Across five diverse academic papers (AlexNet, Attention, LLaMA 3, Philosophy, and Quantum), EduMap maintained a high Average Judge Score (avg 4.45), closely competing with NotebookLM (avg 4.70) while offering significantly more structural utility than TF-IDF (avg 1.15).

Average Educational Quality Score by Paper and System

Average Educational Quality Score by Paper and System.

Average Judge Score across systems

Average Judge Score across systems.


Conclusions and Acknowledgments

EduMap demonstrates that combining multimodal extraction with a two-stage LLM orchestration strategy can transform dense academic PDFs into interactive, hierarchical mind maps that go beyond what linear summarization tools offer. Across five structurally diverse papers, EduMap achieved comparable semantic preservation to NotebookLM (BERTScore F1 of 0.796 vs. 0.790) while producing an editable, synchronized graph format rather than flat text. The system's weakest point, hierarchical organization on argumentative, non-hierarchical documents like philosophy papers, points to a genuine limitation of the mind map format itself, not just the pipeline. A core engineering contribution is the paired markdown-and-graph editing system with an exportable original baseline, giving users a clear boundary between their own thinking and what the AI generated. Future directions include citation-aware node linking, confidence signaling for uncertain extractions, and collaborative real-time editing. We thank Professor Andy Exley and Daanish Hindustani for their guidance throughout this project, and our classmates for their constructive feedback during the idea pitch and poster presentation.