Acoustic Beamforming and Event Detection

Project description

Acoustic Beamforming

For humans, voice is the most natural way of communicating with each other. For this reason the desire of interacting with computers by voice was always present over the past 50 years. The performance capability of modern computers combined with recent advances in the development of artificial intelligence made it possible for devices to perceive voice commands. Cloud-based voice assistants, currently available on the market, are an emerging technology that enables the control of electrical household devices, gathering information from the Internet and setting up all kind of reminders. These interfaces utilize microphones that capture the spoken commands and transfer them into the cloud for further processing. As it can be seen in the picture on the next page, in a real environment there are many challenges to be overcome to receive a clean voice signal that can be interpreted by the device. For example, reflections (1), back ground noise (2), and acoustic echos (3,4) overlap with the desired voice signal and as a consequence degrade the word recognition rate.

Ph.D. Project Facts

ISP Research Team

Andreas Gaich
Eugen Pfann
Mario Huemer

Funding

Infineon Villach, opens an external URL in a new window

Partners

Infineon Villach, opens an external URL in a new window
Infineon Munich, opens an external URL in a new window

Duration

Nov. 2015 - Oct. 2019

FODOK

Therefore microphone arrays can be used to enhance the speech quality and suppress ambient noise. Microphone arrays utilize the spatial information of the sound sources and focus on the direction of the desired source while suppressing sources from other directions. This is called beamforming.

In this Ph.D. project we develop model based as well as deep learning based beamforming algorithms for voice assistance applications and investigate the influence of microphone and array imperfections, such as microphone self noise, complex frequency response mismatch between microphones in the array, and microphone displacement, on the performance of these algorithms.

Acoustic Event Detection

Sound capturing devices become increasingly ubiquitous in different kind of environments ranging from homes and offices to car interiors and on mobile phones. Combining ease of installation and decreasing costs the main purpose of these devices is typically the recording and processing of speech. However, at the same time these devices can be utilized to scan the environment in order to detect events which have a particular acoustic signature. For example, this could be a home appliance break down, a water leak, an intrusion scenario or a person or object falling to the ground.

Machine learning algorithms are applied in this project for acoustic event detection. A main focus lies on an increase of algorithm reliability, as depending on the type of event a false alarm can have similar negative consequences as a non-detected event.

Publications

Gaich A., Huemer M.: "Influence of MEMS Microphone Imperfections on the Performance of First-Order Adaptive Differential Microphone Arrays," in Computer Aided Systems Theory - EUROCAST 2017, Serie Lecture Notes in Computer Science, Vol. 10672, Springer International Publishing, Cham, Seite(n) 170-178, 2018

Name	Purpose	Lifetime	Provider
CookieConsent	This cookie saves your settings about cookie-handling at this website.	1 year	JKU
se_mode	This cookie is used for settings of the site search.	1 year	JKU

Name	Purpose	Lifetime	Provider
_gcl_au	This cookie is used by Google Analytics to understand user interaction with the website.	3 months	Google
_ga	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.	2 years	Google
_gid	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.	1 day	Google
_gat_UA-112203476-1	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.	1 minute	Google
_pk_id	This cookie is used to store a few details about the user such as the unique visitor ID.	13 months	JKU
_pk_ses	This cookie is a short lived cookie used to temporarily store data for the visit.	30 minutes	JKU
_pk_ref	This cookie is used to store the attribution information, the referrer initially used to visit the website.	6 months	JKU

Name	Purpose	Lifetime	Provider
_gcl_au	This cookie is used by Google Analytics to understand user interaction with the website.	3 months	Google
_ga	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.	2 years	Google
_gid	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.	1 day	Google
_gac_UA-112203476-1	Contains campaign related information for the user and measures the AdWords campaign success.	90 days	Google
test_cookie	This cookie is set to determine if the website visitor's browser supports cookies. Doesn't contain personal identifier.	15 minutes	Google
IDE	This cookie carries out information about how the end user uses the website and any advertising that the end user may have seen before visiting the said website.	1 year	Google
_gcl_aw	This cookie is set when a user clicks an ad to reach our website. It informs about the success of campaigns and allows to connect ads to conversion targets.	3 months	Google
AMCV_xx	This is a pattern type cookie name associated with Adobe Marketing Cloud. It stores a unique visitor identifier, and uses an organisation identifier to allow a company to track users across their domains and services.	3 years	LinkedIn
bcookie	Contains a browser ID.	2 years	LinkedIn
bscookie	Contains a browser ID for a secure connection.	2 years	LinkedIn
lang	This cookie is used to store the language preference of our visitors	Session	LinkedIn
lidc	This cookie carries out information about how the end user uses the website and any advertising that the end user may have seen before visiting the said website.	1 day	LinkedIn
lissc	This cookie is used to analyze how a visitor interacts with embedded services.	1 year	LinkedIn
UserMatchHistory	This cookie is set when a user clicks an ad to reach our website. It informs about the success of campaigns and allows to connect ads to conversion targets.	30 days	LinkedIn
fr	This cookie is set when a user clicks an ad to reach our website. It informs about the success of campaigns and allows to connect ads to conversion targets.	90 days	Facebook
fbp	This cookie is used to display advertisings, for example third-party real time offers.	90 days	Facebook
sc_at	This cookie is used to identify a visitor across multiple domains.	1 year	Snap
sc-country	This cookie is used to determine a visitor's country.	1 day	Snap
uid	This cookie sets a random User-ID and helps at real time bidding for display advertising to targeted audiences.	60 days	Adform
C	This cookie identifies if user’s browser accepts cookies. 1 – Cookies are allowed, 3 – Opt-out.	30 days	Adform