Computer vision is guiding cranes

The FCAI success stories series explains why fundamental research in AI is needed and how research results create solutions to the needs of people, society and companies.

Researchers at FCAI are developing a positioning system based on machine learning and an increasingly accurate and inexpensive navigation solution.

At present, industrial cranes use expensive and massive systems for localization. Machines must know where to go at the construction site.

“They also need to be able to take a hold of objects with precision and lift them up safely. At the same time, the machines have to observe their surroundings to avoid accidents. This becomes more and more important when envisioning the future autonomous systems”, says Laura Ruotsalainen, Associate Professor of computer science.

Ruotsalainen and her group work on a project called Artificial Intelligence for Industrial Vision. They are developing a positioning system based on machine learning and an increasingly accurate and inexpensive navigation solution in cooperation with Konecranes, a world-leading group of lifting businesses.

The project focuses on developing a computer vision technique known as visual SLAM (simultaneous localisation and mapping) tolerating the feature-poor areas, varying lights and reflections.

With SLAM, a machine moving at a construction site will be constantly aware of its position with the help of cameras utilising this technology. The cameras will also generate a continuously up-to-date map of the surroundings.

“The hardest part is getting computer vision to recognise humans. This requires deep learning,” Ruotsalainen says. “For monitoring the presence of moving objects in the area, especially humans, multi-object tracking methods are required.”

3D multi-object tracking is able to track the motion of multiple objects and set their position into the three-dimensional coordinate frame. This technique requires developing sophisticated computer vision and deep learning techniques.

“The great thing is that in this study, we will have the chance to work in a research laboratory simulating a genuine factory environment. During the project, we will collect data and identify the biggest problems. At the same time we are developing the visual SLAM technique and deep learning to tackle these problems”, says Ruotsalainen.

The results of the project will enable the development of sustainable industrial premises that are more cost-efficient and safer for the employees.

The results may also benefit the development of self-driving cars in the future since the same questions related to positioning and observing the surroundings concern them as well.

Read more:
Com­puter vis­ion is already guid­ing cranes – Research aims for in­creas­ingly ac­cur­ate nav­ig­a­tion
Research group: Spatiotemporal data analysis