A Platform for AI-Assisted Archival Metadata Generation

Kyeongmin Rim; Owen C. King; Kelley Lynch; Marc Verhagen; James Pustejovsky

doi:10.1007/978-3-031-93160-4_12

Back

Book chapter

A Platform for AI-Assisted Archival Metadata Generation

Kyeongmin Rim, Owen C. King, Kelley Lynch, Marc Verhagen and James Pustejovsky

Culture and Computing, pp.183-203

Lecture Notes in Computer Science, Springer Nature Switzerland

2025

DOI: https://doi.org/10.1007/978-3-031-93160-4_12

Abstract

Artificial intelligence for cultural heritage

Audiovisual archives

Digital Library

This paper presents our latest work on Computational Linguistics Applications for Multimedia Services (CLAMS), a open-source Artificial Intelligence (AI) and machine learning (ML) platform for various cultural institutions in the GLAM sector. CLAMS provides a framework for developing and implementing ML-based computational multimedia analysis tools, and optimizes the processing of audiovisual archival material by seamlessly integrating tools across various media types, including text, audio, video, and images. CLAMS’s primary function, automated content analysis and information extraction, provides archivists with an AI-assisted environment for metadata refinement. This will enable the cataloging of extensive audiovisual collections, which would be impossible to complete manually, thus ultimately increasing the usability of the audiovisual archives and allowing library patrons and media researchers to discover and search the archives more easily. At the core of CLAMS interoperability is the Multi-Media Interchange Format (MMIF), a structured, JSON-based data abstraction that supports a consistent data exchange layer between different computational analysis tools, including AI and ML applications. This allows annotations from one tool to be easily used by others, enabling complex automated content analysis workflows. The paper describes specifics of MMIF, the CLAMS platform and ecosystem, and case studies of CLAMS workflows and evaluation schemes using data from the American Archive of Public Broadcasting (AAPB). These use cases illustrate how CLAMS can enhance metadata for mass-digitized multimedia collections, that is often only implicitly available within the digitized media and are largely unsearchable and held in archives and libraries.

Metrics

1 Record Views

See more details

Details

Title: A Platform for AI-Assisted Archival Metadata Generation
Creators: Kyeongmin Rim
Owen C. King
Kelley Lynch
Marc Verhagen
James Pustejovsky
Contributors: Matthias Rauterberg (Editor)
Publication Details: Culture and Computing, pp.183-203
Series: Lecture Notes in Computer Science
Publisher: Springer Nature Switzerland; Cham
Identifiers: 9924481564501921
Academic Unit: Michtom School of Computer Science; Benjamin and Mae Volen National Center for Complex Systems; Interdepartmental Program in Linguistics and Computational Linguistics
Language: English
Resource Type: Book chapter

A Platform for AI-Assisted Archival Metadata Generation

Abstract

Metrics

Details

Brandeis University Social media