Patent classifications
G01S3/8006
Sound source localization using phase spectrum
An array of microphones placed on a mobile robot provides multiple channels of audio signals. A received set of audio signals is called an audio segment, which is divided into multiple frames. A phase analysis is performed on a frame of the signals from each pair of microphones. If both microphones are in an active state during the frame, a candidate angle is generated for each such pair of microphones. The result is a list of candidate angles for the frame. This list is processed to select a final candidate angle for the frame. The list of candidate angles is tracked over time to assist in the process of selecting the final candidate angle for an audio segment.
Simultaneous acoustic event detection across multiple assistant devices
Implementations can detect respective audio data that captures an acoustic event at multiple assistant devices in an ecosystem that includes a plurality of assistant devices, process the respective audio data locally at each of the multiple assistant devices to generate respective measures that are associated with the acoustic event using respective event detection models, process the respective measures to determine whether the detected acoustic event is an actual acoustic event, and cause an action associated with the actional acoustic event to be performed in response to determining that the detected acoustic event is the actual acoustic event. In some implementations, the multiple assistant devices that detected the respective audio data are anticipated to detect the respective audio data that captures the actual acoustic event based on a plurality of historical acoustic events being detected at each of the multiple assistant devices.
Method, apparatus and computer program product for determining the location of a plurality of speech sources
The present invention discloses a method, apparatus and computer program product for determining the location of a plurality of speech sources in an area of interest, comprising performing an algorithm on a signal issued by either one of said plurality of speech sources in the area to for iteratively recover data characteristic to said signal, wherein the algorithm is an iterative model-based sparse recovery algorithm, and wherein for each of a plurality of points in said area, the iteratively recovered data is indicative of a presence of a plurality of speech sources contributing to the signal received at each of a plurality of points in the area.
METHODS CIRCUITS DEVICES SYSTEMS AND ASSOCIATED COMPUTER EXECUTABLE CODE FOR ACQUIRING ACOUSTIC SIGNALS
The present invention includes methods, circuits, devices, systems and associated computer executable code for acquiring, processing and rendering acoustic signals. According to some embodiments, one or more direction specific audio signals may be generated using a microphone array comprising two or more microphones and an audio stream generator. The audio stream generator may receive a direction parameter from an optical tracking system. There may be provided an audio rendering system adapted to normalize and/or balance acoustic signals acquired from a soundscape.
Performance of a time of flight (ToF) laser range finding system using acoustic-based direction of arrival (DoA)
An acoustic-based Direction of Arrival (DoA) system uses acoustic information to determine the direction of incoming sound, such as a person talking. The direction of the sound is then used to focus a laser-based time of flight (ToF) system to narrow the area of laser illumination, improving the signal to noise ratio because laser illumination is focused on the direction of the sound. The DoA system also provides elevation information pertaining to the source of the sound, to further narrow the required field of view of the laser ToF system.
Method and apparatus for associating audio objects with content and geo-location
An approach is provided for efficiently capturing, processing, presenting, and/or associating audio objects with content items and geo-locations. A processing platform may determine a viewpoint of a viewer of at least one content item associated with a geo-location. Further, the processing platform and/or a content provider may determine at least one audio object associated with the at least one content item, the geo-location, or a combination thereof. Furthermore, the processing platform may process the at least one audio object for rendering one or more elements of the at least one audio object based, at least in part, on the viewpoint.
Methods circuits devices systems and associated computer executable code for acquiring acoustic signals
The present invention includes methods, circuits, devices, systems and associated computer executable code for acquiring, processing and rendering acoustic signals. According to some embodiments, one or more direction specific audio signals may be generated using a microphone array comprising two or more microphones and an audio stream generator. The audio stream generator may receive a direction parameter from an optical tracking system. There may be provided an audio rendering system adapted to normalize and/or balance acoustic signals acquired from a soundscape.
SOUND SOURCE LOCALIZATION APPARATUS
A sound source localization apparatus is provided which can more surely detect a sound source located in a detection target region. The sound source localization apparatus includes a plurality of microphone and a buffle. The buffle has a first surface and a second surface. The second surface is a surface opposite to the first surface. The plurality of microphones are two-dimensionally arrayed and fixed in the first surface. The buffle allows the plurality of microphones to pick up direct sound arriving at the first surface and prevents the plurality of microphones from picking up direct sound arriving at the second surface.
SOUND SOURCE IDENTIFICATION APPARATUS AND SOUND SOURCE IDENTIFICATION METHOD
A sound source identification apparatus includes a sound collection unit including a plurality of microphones, a sound source localization unit configured to localize a sound source on the basis of an acoustic signal collected by the sound collection unit, a sound source separation unit configured to perform separation of the sound source on the basis of the signal localized by the sound source localization unit, and a sound source identification unit configured to perform identification of a type of sound source on the basis of a result of the separation in the sound source separation unit, and a signal input to the sound source identification unit is a signal having a magnitude equal to or greater than a first threshold value which is a predetermined value.
ESTIMATION DEVICE, ESTIMATION METHOD, AND COMPUTER PROGRAM PRODUCT
According to one embodiment, an estimation device includes one or more hardware processors configured to function as: a conversion module configured to perform time frequency conversion on acoustic signals of a plurality of channels to acquire a frequency spectrum; a spatial correlation calculation module configured to calculate a spatial correlation matrix from the frequency spectrum; a spatial correlation filter module configured to calculate a spatial correlation filter from the spatial correlation matrix; and a direction estimation module configured to estimate general direction information from a partial element included in the spatial correlation filter.