data segmentation

Beyond Static Chunking: Multimodal and Adaptive Segmentation for Improved Information Retrieval

Introduction Retrieval-Augmented Generation systems [Li et al., 2025], called RAG, rely heavily on how documents are segmented before indexing. This preprocessing step, known as chunking, directly affects retrieval quality and, consequently, the performance of downstream question-answering systems. In practice, chunking strategies [Jain et al., 2025] hardly take into account the document structure; instead, documents are […]

Beyond Static Chunking: Multimodal and Adaptive Segmentation for Improved Information Retrieval Read More »