Latest articles

Beyond Static Chunking: Multimodal and Adaptive Segmentation for Improved Information Retrieval
Introduction Retrieval-Augmented Generation systems [Li et al., 2025], called RAG, rely heavily on how documents are segmented before indexing. This preprocessing step, known as chunking, directly affects retrieval quality and, consequently, the performance of downstream question-answering systems. In practice, chunking strategies [Jain et al., 2025] hardly take into account the document structure; instead, documents are usually split into fixed-size segments without considering their semantic content or modality. Some more advanced

DevOps & Middleware: a new project to facilitate the management of our deployments and enhance their reliability
A new collaboration project has been initiated with the team Spirals of INRIA Lille laboratory. In this project, we want to address the steering, orchestration, and maintenance mechanisms for our applications’ packaging, delivery, and deployment activities. The lack of formalization and traceability on the actions that are carried out for the deployment of our applications,

The Helios Project
A few weeks ago, the DRIT began working on a particularly innovative project that should be useful to us all: The Helios project.The ambition is to build an automatic observatory that collects, sorts, and classifies information from the net. The system is intended to be flexible and should enable us to keep a watch on

Knowledge at your fingertips: Building an Ontological Knowledge Base from our Editorial database
Our times are increasingly influenced by the prevalence of large volumes of data. These data most often hide great human intelligence. This intrinsic knowledge; whatever the field; would allow our information systems to be much more efficient in the processing and interpretation of structured and unstructured data. For example, the process of finding relevant documents

Automating the Detection of Duplicates in our Databases
Today’s complex applications for knowledge extraction and data mining use heterogeneous and distributed data. In this context, the quality of any decision depends on the quality of the data used. In fact, with the lack of accurate and reliable data, bad decisions can be made. In order to provide a better understanding of the source

Designing the right way to express constraints in a Java Architecture for Optimisation problems
We are often brought to solve problems with constraints in real life, such as: shopping in several distant stores, planning and organizing vacation expenses, or packing things in boxes during a move. Resolving these problems with constraints is not always easy. More particularly, when it comes to optimizing a set of criteria such as time,

Testing the migration of Graphical User Interfaces
Technologies and frameworks are not immortal, it’s a fact. For a software editor like us, it is regularly a necessity to migrate a framework to another. When you consider, large software like e-sedit for example, migrating the entire software in one shot is almost impossible. Many companies are migrating their software systems. And so, we

The Traceability chronicles – Episode 1: How logs can help you understand what your users do?
We start here a series of articles explaining some of our most interesting results with regard to traceability of applications. We will address the issues of automatic code instrumentation to trace the system by itself and the underlying difficulties. We will also discuss what constitutes a good, useful, and usable application trace. We will also

Citizen & Digital society
After signing the “RESET manifesto” published in Le Monde, Berger-Levrault is involved in a research collaboration with the association for the Foundation of a New Internet Generation (Fing), a reference think & do tank on digital transformations. This collaboration takes the form of two projects focusing on the theme “citizens and the digital society” from

When magic happens! Making Java and Visual Basic 6 dance together.
Visual Basic is a third-generation event-driven programming language belonging to Microsoft. It is based on the Component Object Model (COM) programming model and designed for beginners. The Last Version is the 6 (1998) and the core of FirmaDoc, our Spanish product dedicated to document and file management for public administration is written in VB6. However,

A Mobile Social Robot as a tool for Nursing Homes
Over the centuries, scientific advances in medicine, pharmacology, and various surgeries have allowed the population to age healthier for longer. Our life expectancy in France has therefore now reached the age of 85.6 for women and 79.7 for men (Papon et al., 2020). This increase in life expectancy suggests an increase in the aging population

Franco-German Dialogue Among AI Industry Leaders (Paris – France)


PMS 2026 – 34th International Conference on Program Comprehension (Toulouse – France)

ICPC – 34th International Conference on Program Comprehension (Rio de Janeiro – Brazil)

CHI 2026 – International conference on Human-Computer Interaction (Barcelona – Spain)

A New Milestone in Building a European AI Ecosystem for Industry
Last Friday marked a significant step forward for the European ecosystem of artificial intelligence applied to industry. After more than a year of close collaboration, the report resulting from the Franco-German dialogue on industrial AI was officially presented to French and German authorities during a dedicated forum held at the Ministry of Economy. This initiative,
Yearbook Research & Innovation 2025: Governed, Responsible, and reality-based Research!
We are proud to announce the publication of our 2025 Research & Innovation Yearbook! More than just an annual review, this document marks a structural shift in the way we design and conduct research. Innovation is no longer seen as a simple technological promise, but as a demanding discipline, confronted with the constraints of reality
Camille Dupré Ph.D. thesis defense: “Pad-based Interaction in Mixed Reality environments”
Thursday 18th December at 2p.m. Paris time, Camille Dupré, Ph.D. Candidate has defended her thesis named “Pad-based Interaction in Mixed Reality environments”. Her thesis defense took place at the LISN, in Gif-sur-Yvette (660 Av. des Sciences Bâtiment, 91190), France. Take a look at the summary below. Summary Mixed Reality (MR) environments integrate virtual elements into
Nihed Bendahman Ph.D. thesis defense: “Evaluation and mitigation of hallucinations in automatic summarization in the specific context of legal documents”
Monday 15th December at 2p.m. Paris time, Nihed Bendahman, Ph.D. Candidate has defended her thesis named “Evaluation and mitigation of hallucinations in automatic summarization in the specific context of legal documents”. Her thesis defense took place at the IRIT Research Laboratory, in Toulouse, France. Take a look at the summary below. Summary Legal monitoring is a
Gabriel Darbord Ph.D. thesis defense: “Automatic test generation to help modernize our applications”
Friday 5th December at 9a.m. Paris time, Gabriel Darbord, Ph.D. Candidate has defended his thesis named “Automatic test generation to help modernize our applications”. His thesis defense took place in Lille, France. Take a look at the summary below. This thesis is fully in line with the partnership between Berger-Levrault and Inria, which aims to
Berger-Levrault strengthens its ties with AI startups!
We are proud to announce that we have joined Hub France IA, the largest association dedicated to artificial intelligence in France. This network now brings together more than 250 members—companies, startups, research laboratories, and institutions—who share the same goal: to accelerate the development and adoption of AI in France and Europe. Getting closer to the
Celebrating New PhDs from the BL.Research Team!
At Berger-Levrault, research is more than a mission—it’s a shared adventure. As the new academic year begins, we are proud to celebrate the success of four of our colleagues from the BL.Research team, who have reached a major milestone in their scientific journeys: the defense of their doctoral theses. These achievements are the result of
Hamza Safri Ph.D. thesis defense: “Federated learning for the IoT : Application for Industry 4.0”
Thuesday 24th June at 3pm Paris time, Hamza Safri, Ph.D. Candidate has defended his thesis named “Federated learning for the IoT: Application for Industry 4.0”. His thesis defense took place at the Inria Minatec Grenoble, Grenoble, France. Take a look at the summary below. Keywords: Model generalization, predictive maintenance, industrial IoT, federated learning, edge network,