Intelligent Semantic Extraction and Transformation Pipeline for Large-Scale Multimedia Data Processing
Publication Date
9-24-2025
Document Type
Conference Proceeding
Publication Title
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting Bmsb
DOI
10.1109/BMSB65076.2025.11165552
Abstract
This paper presents a comprehensive framework for integrating semantic technologies within the ETL (Extraction, Transformation, Loading) process to improve multimedia metadata extraction and enrichment. The proposed architectural design combines traditional ETL pipelines with advanced semantic techniques, allowing for precise extraction and transformation of metadata using ontology mappings. At the core of this framework are algorithms designed to extract metadata from multimedia sources, transform it based on ontological structures, and further enrich it by adding contextualized semantic information. The architecture is evaluated through a performance analysis focusing on two key metrics: Multimedia Metadata Extraction Accuracy and Semantic Enrichment Quality. Experimental results reveal significant accuracy improvements and higher enrichment quality when using semantically enriched ETL, underscoring the value of semantic technologies in optimizing data management and metadata processes for multimedia content.
Keywords
Extraction, loading, metadata, multimedia, semantic processing, transformation
Department
Applied Data Science
Recommended Citation
Shih Yu Chang, Sourab Rajendra Saklecha, and Yiyan Wu. "Intelligent Semantic Extraction and Transformation Pipeline for Large-Scale Multimedia Data Processing" IEEE International Symposium on Broadband Multimedia Systems and Broadcasting Bmsb (2025). https://doi.org/10.1109/BMSB65076.2025.11165552