Book chapter, 2025
Other projects
- (ROBOWELL) ROBOts as Welfare Technologies and Actors for ELderLy Care: A Nordic Model for Integration of Advanced Assistive Technologies
- AMBIENT – Bodily Entrainment to Audiovisual Rhythms
- fourMs Lab Upgrade
- Norwegian Centre for Embodied AI (NCEI)
- Predictive and Intuitive Robot Companion (PIRC)
- RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion
Latest results
Book chapter
-
-
Book chapter, 2025
PINE: Planning and Identifying Neural Network for Thinking Fast and Slow
-
Book chapter, 2025
An Autonomous Floor Clearing Strategy to Tidy up Unknown Home Environments with a Mobile Manipulator Robot
-
Book chapter, 2025
Heart Rate Forecasting Using Ultra-Wideband Radar with Sequence-to-Sequence Model
-
Book chapter, 2025
Situation-Based Navigation Strategy Switching for Mobile Robots in Dynamic Pedestrian Environments
-
Book chapter, 2025
Integrating Bilevel Planning and Offline Skill Learning for Enhancing Mobile Manipulation
Journal article
-
Journal article, 2026
Dual Process Dreamer: Fast and Slow Decision-Making with World Models
-
Journal article, 2026
Investigating Auditory–Visual Perception Using Multi-Modal Neural Networks with the SoundActions Dataset
Musicologists, psychologists, and computer scientists study relationships between auditory and visual stimuli from very different perspectives and using various terminologies and methodologies. This article aims to bridge the gap between phenomenological sound theory, auditory–visual theory, and audio–video processing and machine learning. We introduce the SoundActions dataset, a collection of 365 audio–video recordings of (primarily) short sound actions. Each recording has been human‑labeled and annotated according to Pierre Schaeffer’s theory of reduced listening, which describes the property of the sound itself (e.g., ‘an impulsive sound’) instead of the source (e.g., ‘a bird sound’). With these reduced‑type labels in the audio–video dataset, we conducted two experiments: (1) fine‑tuning the latest audio–video transformer model on the reduced‑type labels in the SoundActions dataset, proving that the model can recognize reduced‑type labels, and observing that the modality‑imbalance phenomenon is similar to the added value theory by Michel Chion and (2) proposing the Ensemble of Perception Mode Adapters method inspired by Pierre Schaeffer’s three listening modes, improving the audio–video model also on reduced‑type tasks.
-
Journal article, 2025
Privacy-Preserving 3D Lidar-Based Multi-Modal Activity Recognition in Human-Robot Interaction
-
Journal article, 2025
Robot Ethics: Ethical, Legal, and User Perspectives in the Development and Application of Robotics and Automation [From the Guest Editors]