This project is maintained by Embodied-Space
Audio Localization Demo from quin kennedy on Vimeo.
The Azure Kinect is a very capable little device. I have lots of experience using previous Kinect devices for scene segmentation, gestureal interfaces, skeleton tracking, point cloud rendering and so much more. A lot of those features are quicker, higher resolution, more robust, and in a smaller package for this iteration.
Because the prompt specifically asked me to explore a new feature, I wanted to dig into parts of the Kinect that are vastly different from previous generations, or that I don’t have as much experience with. The circular microphone array on the Azure Kinect immediately caught my interest.
I have put together a technical prototype which listens for claps and snaps, and calculates the direction from which the sound came. Even though solutions for this exist which are much more robust than what I could build in a couple of days, I wanted to prove out that the Kinect microphone array was accessible enough and high-quality enough to be able to support this use-case without getting into the weeds on implementing or compiling research-grade projects.
This shows a clap coming almost directly from the right side of the kinect
This prototype uses Python to analyze the microphone channels and calculate the directionality of sound sources. That direction is published over Spacebrew to allow for flexibility in the visualization side. I put together a simple web visualizer (shown above) to display the direction relative to the Azure Kinect in both plan and azimuth view.
See the Github Wiki for details on setting up the prototype.
I am proposing an interactive installation where the primary interaction is clapping. This clapping gets localized by the Azure Kinect microphone array and triggers a number of effects in the space.
As people add to the ‘track’, older samples will be dropped. If nobody is interacting with the track, previous sequences can fade in and out as the installation “dreams.”

This installation view shows an arrangement with: