Patent classifications
G01S3/8006
METHOD AND APPARATUS FOR RECOGNIZING VOICE
A method and an apparatus for recognizing a voice are provided. The method may include: inputting a target voice into a pre-trained voice recognition model to obtain an initial text output by at least one recognition network in the voice recognition model, the recognition network including a plurality of preset types of processing layers, and at least one type of processing layer of the recognition network being obtained by training based on a voice sample in a preset direction interval; and determining a voice recognition result of the target voice, based on the initial text.
DETECTION DEVICE AND METHOD FOR AUDIO DIRECTION ORIENTATION AND AUDIO PROCESSING SYSTEM
A detection device and a method for audio direction orientation and an audio processing system are provided. The device includes a first filter, which performs a first infinite impulse response operation on each first audio beam to generate second audio beams; an absolute value operator which performs an absolute value operation on amplitude of each second audio beam to generate third audio beams; a second filter which performs a second infinite impulse response operation on each third audio beam to smooth each third audio beam to generate fourth audio beams; and a DOA processor which divides the fourth audio beams into audio beam groups, and selects a selected audio beam from each audio beam group according to energy of each fourth audio beam in each audio beam group to output beam information corresponding to the selected audio beams and used in a speech recognition and for determining a voice direction.
Device and method for sound localization
Disclosed is a device for sound localization. The device can determine a direction of sound adequately, and includes a spatial feature generator, a voice detector, an angle selector, and an angle retriever. The spatial feature generator generates M spatial feature signals according to signals of N microphones of a microphone array. The voice detector generates at least one voice detection signal according to at least one of the signals of the N microphones. The angle selector outputs a candidate angle signal according to the M spatial feature signals to indicate a candidate direction of sound. The angle retriever generates a sound detection result according to the M spatial feature signals to indicate whether any sound source exists, and then outputs an estimated angle signal indicative of a direction of sound according to the sound detection result, the at least one voice detection signal, and the candidate angle signal.
Method for voice recording and electronic device thereof
A method for voice recording and an electronic device thereof are provided. The method includes determining a voice recording mode from among a plurality of voice recording modes, determining a voice beamforming direction according to the determined voice recording mode, and recording voice signals based on the determined voice beamforming direction.
AUDIO RECOGNITION METHOD, METHOD, APPARATUS FOR POSITIONING TARGET AUDIO, AND DEVICE
Embodiments of this application disclose method and apparatus for positioning a target audio signal by an audio interaction device, and an audio interaction device The method includes: obtaining audio signals in a plurality of directions in a space, and performing echo cancellation on the audio signal, the audio signal including a target-audio direct signal; obtaining weights of a plurality of time-frequency points in the audio signals, a weight of each time-frequency point indicating, at the time-frequency point, a relative proportion of the target-audio direct signal in the audio signals; weighting time-frequency components of the audio signal at the plurality of time-frequency points separately for each of the plurality of directions by using the weights of the plurality of time-frequency points, to obtain a weighted audio signal energy distribution; and obtaining a sound source azimuth corresponding to the target-audio direct signal in the audio signals accordingly.
CONTROLLING A DEVICE BY TRACKING MOVEMENT OF HAND USING ACOUSTIC SIGNALS
A method, device and computer program product for controlling the device by tracking a movement of a hand or other objects. The device receives acoustic signals. At least a portion of the received signals are transformed into two-dimensional sinusoids whose frequencies are proportional to an angle-of-arrival (AoA) and a propagation distance of the reflected signals. An AoA-di stance profile is derived based on signals received from the object by evaluating frequencies of the two-dimensional sinusoids. Then, an AoA-di stance pair is derived from the AoA-di stance profile. A current location of the object is determined based on the estimated AoA-di stance pair. The device then performs a command in response to detecting that the user moved to perform the command based on prior and current locations of the object.
WIRELESS COMMUNICATION SYSTEM AND WIRELESS COMMUNICATION APPARATUS
A wireless communication system includes a control station and a plurality of second wireless communication apparatuses. The control station includes a management table that holds sending permission information indicating whether or not to transmit a packet for requesting to send an orientation estimation auxiliary signal. The plurality of second wireless communication apparatuses each refer to the management table, transmit the packet to a first wireless communication apparatus in response to the sending permission information, and perform orientation estimation using the orientation estimation auxiliary signal transmitted from the first wireless communication apparatus.
SIGNAL PROCESSING APPARATUS AND METHOD, AND PROGRAM
The present technology relates to a signal processing apparatus, a signal processing method, and a program capable of improving determination accuracy of a direct sound direction.
The signal processing apparatus includes a direction estimation unit that detects a sound section from a sound signal, and estimates a coming direction of a sound contained in the sound section, and a determination unit that determines which of sounds in a plurality of the coming directions is a sound arriving earlier in a case where the plurality of coming directions is obtained for the sound section by the estimation. The present technology is applicable to a signal processing apparatus.
BEAMFORMER ENHANCED DIRECTION OF ARRIVAL ESTIMATION IN A REVERBERANT ENVIRONMENT WITH DIRECTIONAL NOISE
An estimator of direction of arrival (DOA) of speech from a far-field talker to a device in the presence of room reverberation and directional noise includes audio inputs received from multiple microphones and one or more beamformer outputs generated by processing the microphone inputs. A first DOA estimate is obtained by performing generalized cross-correlation between two or more of the microphone inputs. A second DOA estimate is obtained by performing generalized cross-correlation between one of the one or more beamformer outputs and one or more of: the microphone inputs and other of the one or more beamformer outputs. A selector selects the first or second DOA estimate based on an SNR estimate at the microphone inputs and a noise reduction amount estimate at the beamformer outputs. The SNR and noise reduction estimates may be obtained based on the detection of a keyword spoken by a desired talker.
Method of providing service based on location of sound source and speech recognition device therefor
A speech recognition device is provided. The speech recognition device includes at least one microphone configured to receive a sound signal from a first sound source, and at least one processor configured to determine a direction of the first sound source based on the sound signal, determine whether the direction of the first sound source is in a registered direction, and based on whether the direction of the first sound source is in the registered direction, recognize a speech from the sound signal regardless of whether the sound signal comprises a wake-up keyword.