Overview
“Smarter Extraction of ScholArly MEtadata using Knowledge Graphs and Language Models” and abbreviated as “SESAME”. The mission statement of SESAME is to bring together researchers and practitioners to explore how AI-driven curation approaches leveraging large language models and knowledge graphs to strengthen digital libraries infrastructures. The proposed workshop is intended for a broader spectrum of participants within the JCDL community, including researchers, data curators, and policy makers. It is particularly relevant to those working in digital library infrastructures, metadata curation, knowledge graph construction, information extraction, and natural language processing. Hence, participants from research backgrounds fields such as scientometrics, open science, and AI ethics will also find value, as the workshop addresses cross-cutting issues of data interoperability and transparency. The workshop aim to bring scientific community at platform encompssing of digital libraries, metadata workflows, large language models and knowledge graph. The workshop will combine foundational discussions with advanced perspectives, making it accessible to researchers across the discpline. The planned sessions keynotes talks, and collaborative activities will further ensure that participants of diverse backgrounds can contribute meaningfully to discussions and prospective conclusions. Emphasize the bridge between LLMs and linked data / KGs for high-quality scholarly metadata: Author Disambiguation, Affiiation normalization, citation context understanding, and evaluation.
Topics of Interest
- Research Artifacts Metadata Modeling and Granularity
- Metadata of scholarly publications, datasets, software, and models
- Metadata quality assessment, enrichment, and curation
- Research artifacts provenance across digital libraries
- Cross-disciplinary metadata interoperability
- Large Language Models (LLMs) and NLP for Metadata
- Research artifacts metadata extraction using LLMs
- Prompt engineering, fine-tuning for scholarly information extraction
- Evaluation, reliability and issues for LLM-generated metadata
- Comparative studies of LLM-based vs traditional methods
- LLMs for metadata curation and normalization
- AI-driven curation, preservation at scale, and long-term accessibility
- Knowledge Graphs and Linked Data
- Construction of scholarly knowledge graphs from heterogeneous metadata
- Linking and aligning entities across repositories and infrastructures
- Applications of KGs for discovery, recommendation, and impact
- Digital Libraries and Infrastructure
- Integration of metadata workflows into digital library systems
- Benchmarks, datasets, and shared tasks for metadata extraction and modeling
- System design for metadata-intensive digital library applications
- Societal, Ethical Impact and Future Policy Directions
- Ethical implications of AI-driven metadata generation and curation
- Metadata for open science, reproducibility, and research integrity
- Societal impacts of metadata granularity on scholarly evaluation and equity
- Policy frameworks and governance for interoperable metadata infrastructures
Call for Papers
The workshop invites original research on the topics above. The workshop call focuses on will call for papers in three categories mentioned below. Each submission will be reviewed by domain experts according to the JCDL guidelines.
- Long Papers: 6–8 pages (Excluding References)
- Short Papers: 2–4 pages (Excluding References)
- Demo Papers: 2–4 pages (Excluding References)
Accepted papers will be published at CEUR-WS proceedings.
Important Dates (AoE)
- Paper submission: YYYY-MM-DD
- Notification: YYYY-MM-DD
- Camera-ready: YYYY-MM-DD
- Workshop: YYYY-MM-DD
Submission
- Site: EasyChair
- Format: ACM/IEEE template (link)
- Length: as listed in CFP
- Anonymization: single/double-blind (state policy and self-citation rules)
- Supplementary: data, code, and preprints encouraged
Program
Time | Session |
---|---|
09:00–09:15 | Opening & Welcome |
09:15–10:15 | Keynote |
10:15–10:45 | Coffee Break |
10:45–12:15 | Paper Session 1 |
12:15–13:30 | Lunch |
13:30–15:00 | Paper Session 2 |
15:00–15:30 | Break |
15:30–16:30 | Panel / Discussion |
16:30–17:00 | Closing |
Keynote(s)
Prof. Dr. Silvio Peroni Director at Open Citations, University of Bologna, Italy
TBA
Organizers
- Dr. Muhammad Asif Suryani, Knowledge Technologies for the Social Sciences (KTS), Leibniz-Institut fur Sozialwissenschaften (GESIS), Köln, Germany
- Dr. Brigitte Mathiak, Knowledge Technologies for the Social Sciences (KTS), Leibniz-Institut fur Sozialwissenschaften (GESIS), Köln, Germany
- Dr. Florian Reitz, Schloss Dagstuhl Leibniz-Zentrum für Informatik Wadern, Germany
- Dr. Florian Jäckel, Schloss Dagstuhl Leibniz-Zentrum für Informatik, Wadern, Germany
- Prof. Dr. Ansgar Scherp, Data Science and Big Data Analytics, Ulm University (UULM) Ulm, Germany
Program Committee
TBA
Registration
Use the JCDL 2025 registration system (link TBA).
Venue & Travel
Co-located with JCDL 2025 (venue TBA).
Code of Conduct
We follow the conference Code of Conduct. Contact the organizers with concerns.
Contact
Questions? Email contact@domain or open an issue in this repository.
© 2025 SESAME Organizers •
contact •
GitHub Repo
Site setup and layout assistance by ChatGPT (GPT-5 Thinking).