datalab
Innovation 18/04/2024

CLS DataLab: When Data Springs into Action

The CLS DataLab is an internal, cross-functional innovation hub within our group, made up of data scientists, developers, and Big Data engineers.

Its primary mission is to develop large-scale data transformation software and generate knowledge from raw data. These algorithmic processes rely heavily on Artificial Intelligence, particularly through machine learning (ML) and deep learning (DL). The DataLab offers innovative solutions to tackle complex problems and often explores new techniques that push the boundaries of the current state of the art.

Bringing together a wide range of expertise under one roof, the DataLab specializes in data processing and machine learning: data visualization, supervised and unsupervised learning, deep learning, recurrent and feedforward models, and more.

 

Who’s behind the CLS DataLab?

Graduates from top engineering schools and holders of specialized master’s degrees in machine learning, the members of the CLS DataLab have hands-on professional experience in the fields of machine learning, deep learning, and reinforcement learning.

These skills are applied to topics such as automated processing of imagery and Earth observation data, classification and segmentation, as well as time series analysis for tracking data from beacons, land vehicles, and marine vessels.

In particular, the techniques implemented are designed to complement domain-specific expertise by allowing expert teams to focus on the most complex cases, while promoting the automation of data enhancement processes.

 

You often talk about training your AIs and working at large scale—can you tell us more?

On a daily basis, our DataLab helps make machine learning and artificial intelligence more accessible across the CLS Group by:

  • Conducting advanced studies applied to real-world use cases,
  • Developing scalable data processing and machine learning platforms that can be industrialized.
  • Supporting the group’s tech watch efforts on systematic data processing and AI

We focus on making our models scalable, and our development processes are streamlined through a code template used across many activities. This approach makes our code more robust, efficient, sustainable, and reusable for various client use cases—something few companies can offer today.

 

deep learningHow does CLS master these new technologies?

At CLS, excellence is our compass, and our teams of data science experts embody this constant pursuit of innovation. Our data enthusiasts have a firm grasp of key domains, positioning CLS at the forefront of satellite observation-based solutions. Our experts excel in image and signal processing to extract critical insights.

With strong command of statistical sciences, they wield this discipline as a powerful tool, shaping data to reveal meaningful trends and key information. Our team of data scientists also harnesses the full potential of Deep Learning, allowing us to process the vast volumes of data hosted in our Data Centers and deliver solutions that leverage the latest advances in the field.

Indeed, at CLS, we have access to a massive volume of data hosted in-house, within our own Data Centers.

 

In your opinion what makes the CLS DataLab unique?

Without a doubt, our business-oriented approach. Our teams don’t just understand these technologies—they apply them directly to meet the operational needs of our clients.

At CLS, we believe that technology is not an end in itself, but a means to achieve broader goals. Our teams are structured to bring AI, Machine Learning, and Deep Learning to experts working in diverse fields such as environmental monitoring, sustainable fisheries management, maritime safety, energy, infrastructure, and mobility.

We have thus integrated data scientists within our organization, directly into our commercial departments, placing them as close as possible to our clients and business needs. Together, we build bridges between big data and field experts, enabling a deeper and more informed understanding of the activities we monitor.

 

I can give you a concrete example in hydrology, specifically in the reconstruction of bathymetric data (the topography of water bodies). To this day, I’m extremely proud of this achievement, as to my knowledge, it is unique in the world.

In a context of increasing water stress, knowing the available water reservoirs is key to the sustainable management of this precious resource. In this case, the business need was to estimate the topography of the bottom of a water body in order to calculate its volume from its surface area. While satellite techniques for measuring surface areas are well established today, there was still no generalized method for estimating lakebed topography.

Thanks to our data scientists, specifically Jérémy Augot on this project, and their collaboration with our hydrology engineers and AI expertise, we were able to develop a solution capable of reconstructing the bathymetry of a given water body using surrounding topographic data.

topology lake gimone france
In the image on the left, we can see the topography surrounding Lake Gimone in the southwest of France, as well as the surface of the studied lake outlined in red.
In the image on the right, we see the reconstruction of the lake’s bathymetry generated by CLS’s AI.

 

CLS is known for developing solutions that support the sustainability of our planet. How is expertise in data science a key asset in this context?

At our company, innovation is the driving force behind our services. We believe in the power of data to understand, protect, and sustainably manage our planet. Our data science experts are the architects of this vision, applying their know-how to create meaningful impact in critical fields.

We are shaping the landscape of satellite observation with committed passion—one algorithm at a time.

 

CLS insights are known for their relevance. What’s your secret?

The cornerstone of our relevance lies in massive, consistent, and diverse data. Since 1986, CLS has established itself as a pioneer in Earth data collection, becoming the guardian of our planet’s health records. Our data centers hold an invaluable treasure, over 3,000 terabytes of data patiently waiting to be leveraged by our data scientists.

CLS has invested in the future with the creation of a Data Lake. More than just a data reservoir, it’s a sophisticated ecosystem where vast amounts of pre-formatted, ready-to-use data are waiting to be explored. This Data Lake is the playground of our data scientists, a space where innovation comes to life in response to our clients’ needs. Additionally, with controlled access to over 400 satellites, we stand out thanks to the richness and diversity of our data sources. Our data science services benefit from exceptional connectivity to space, enabling global and detailed coverage of our planet.

Contact us