Jim Tørresen

Informatics

Other projects

Latest results

Book chapter

Book chapter, 2025

AI-Based User Gesture Recognition for Human-Robot Interaction Using Wrist Sensors

Geir Paulsen ; Juan Sebastian Cardenas ; Rosa Nicoline Pham Alsgaard ; Adel Baselizadeh ; Md Zia Uddin ; Jim Tørresen
Book chapter, 2025

PINE: Planning and Identifying Neural Network for Thinking Fast and Slow

Håkon Haustreis Tønnessen ; Md Zia Uddin ; Jim Tørresen
Book chapter, 2025

An Autonomous Floor Clearing Strategy to Tidy up Unknown Home Environments with a Mobile Manipulator Robot

Letícia dos Santos ; Jim Tørresen ; Mariana Kolberg ; Renan Maffei
Book chapter, 2025

Heart Rate Forecasting Using Ultra-Wideband Radar with Sequence-to-Sequence Model

Hoang Minh Pham ; Farzan Majeed Noori ; Md Zia Uddin ; Jim Tørresen
Book chapter, 2025

Situation-Based Navigation Strategy Switching for Mobile Robots in Dynamic Pedestrian Environments

Shunsuke Goka ; Ørjan Strand ; Jun Miura ; Jim Tørresen
Book chapter, 2025

Integrating Bilevel Planning and Offline Skill Learning for Enhancing Mobile Manipulation

Shin Watanabe ; Geir Horn ; Jim Tørresen ; Kai Olav Ellefsen

Journal article

Journal article, 2026

Dual Process Dreamer: Fast and Slow Decision-Making with World Models

Tobias Lømo ; Adel Baselizadeh ; Kai Olav Ellefsen ; Jim Tørresen
Journal article, 2026

Investigating Auditory–Visual Perception Using Multi-Modal Neural Networks with the SoundActions Dataset

Jinyue Guo ; Jim Tørresen ; Alexander Refsum Jensenius

Musicologists, psychologists, and computer scientists study relationships between auditory and visual stimuli from very different perspectives and using various terminologies and methodologies. This article aims to bridge the gap between phenomenological sound theory, auditory–visual theory, and audio–video processing and machine learning. We introduce the SoundActions dataset, a collection of 365 audio–video recordings of (primarily) short sound actions. Each recording has been human‑labeled and annotated according to Pierre Schaeffer’s theory of reduced listening, which describes the property of the sound itself (e.g., ‘an impulsive sound’) instead of the source (e.g., ‘a bird sound’). With these reduced‑type labels in the audio–video dataset, we conducted two experiments: (1) fine‑tuning the latest audio–video transformer model on the reduced‑type labels in the SoundActions dataset, proving that the model can recognize reduced‑type labels, and observing that the modality‑imbalance phenomenon is similar to the added value theory by Michel Chion and (2) proposing the Ensemble of Perception Mode Adapters method inspired by Pierre Schaeffer’s three listening modes, improving the audio–video model also on reduced‑type tasks.
Journal article, 2025

Privacy-Preserving 3D Lidar-Based Multi-Modal Activity Recognition in Human-Robot Interaction

Adel Baselizadeh ; Md Zia Uddin ; Weria Khaksar ; Diana Saplacan Lindblom ; Jim Tørresen
Journal article, 2025

Robot Ethics: Ethical, Legal, and User Perspectives in the Development and Application of Robotics and Automation [From the Guest Editors]

Jim Torresen ; Cecilia Laschi ; Edson Prestes ; Lydia E. Kavraki ; Praminda Caleb-Solly ; Yueh-Hsuan Weng

More results in NVA…

MishMash Centre for AI and Creativity

A Norwegian Research Consortium

Jim Tørresen

Other projects

Latest results

Book chapter

AI-Based User Gesture Recognition for Human-Robot Interaction Using Wrist Sensors

PINE: Planning and Identifying Neural Network for Thinking Fast and Slow

An Autonomous Floor Clearing Strategy to Tidy up Unknown Home Environments with a Mobile Manipulator Robot

Heart Rate Forecasting Using Ultra-Wideband Radar with Sequence-to-Sequence Model

Situation-Based Navigation Strategy Switching for Mobile Robots in Dynamic Pedestrian Environments

Integrating Bilevel Planning and Offline Skill Learning for Enhancing Mobile Manipulation

Journal article

Dual Process Dreamer: Fast and Slow Decision-Making with World Models

Investigating Auditory–Visual Perception Using Multi-Modal Neural Networks with the SoundActions Dataset

Privacy-Preserving 3D Lidar-Based Multi-Modal Activity Recognition in Human-Robot Interaction

Robot Ethics: Ethical, Legal, and User Perspectives in the Development and Application of Robotics and Automation [From the Guest Editors]