Paper: | MSP-P1.3 | ||
Session: | Human Machine Interface; Signal Processing for Media Integration and Application | ||
Time: | Friday, May 21, 09:30 - 11:30 | ||
Presentation: | Poster | ||
Topic: | Multimedia Signal Processing: Human-Machine Interface | ||
Title: | MULTIPLE PERSON AND SPEAKER ACTIVITY TRACKING WITH A PARTICLE FILTER | ||
Authors: | Neal Checka; Massachusetts Institute of Technology | ||
Kevin Wilson; Massachusetts Institute of Technology | |||
Michael Siracusa; Massachusetts Institute of Technology | |||
Trevor Darrell; Massachusetts Institute of Technology | |||
Abstract: | In this paper, we present a probabilistic tracking framework thatcombines sound and vision to track multiple people. In a cluttered or noisy scene multi-person tracking estimates have a distinctly non-Gaussian distribution. We apply a particle filter with audio and video state components, and derive observation likelihoods based on both audio and video measurements. Our state includes the number of people present, their positions, and whether each person is talking. We show experiments in an environment with sparse microphones and monocular cameras. Our results show that our system can accurately track person locations and speaker activity. | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops