archaeological-informatization-poc

Snapshot of the proof-of-concept pipeline from KIM Hongyeon’s 2025 paper on LLM-based metadata extraction from archaeological excavation reports (PDF text/image extraction, caption mapping, LLM structuring, SQLite storage, glossary-augmented Q&A).