Machine Learning for Audio Analytics Intern

  • 419 Phillip St, Waterloo, ON N2L 3X2, Canada
  • Employees can work remotely
  • Intern
  • Legal Entity: ROBERT BOSCH INC.

Company Description

The Bosch Research and Technology Center North America with offices in Sunnyvale, California, Pittsburgh, Pennsylvania and Cambridge, Massachusetts is part of the global Bosch Group (, a company with over 70 billion euro revenue, 400,000 people worldwide, a very diverse product portfolio, and a history of over 125 years. The Research and Technology Center North America (RTC-NA) is committed to providing technologies and system solutions for various Bosch business fields primarily in the areas of Human-Machine Intelligence (HMI), Robotics, Energy Technologies, Internet Technologies, Circuit Design, Semiconductors and Wireless, and MEMS Advanced Design.

The focus of our global research on Human Machine Collaboration includes Big Data Visual Analytics, Explainable AI, Audio Analytics, NLP, Conversational AI, Mixed Reality and Smart Wearables, etc. We develop intuitive, interactive and intelligent solutions to enable inspiring UX for Bosch products and services in application areas such as autonomous driving, car infotainment and driver assistance systems (ADAS), Industry 4.0 and Internet of Things (IoT), security systems, smart home and building solutions, health care, and robotics.

As a part of the global Human Machine Interaction research unit, our Visual Analytics & eXplainable AI group is responsible for shaping the future industrial AI experience for Bosch products and services by combining cutting-edge technologies of machine learning, data analysis and interactive visualization. We research and develop scalable, transparent, and intelligent big data analytic solutions (e.g. audio, images, sensor logs) for various domains including Industry 4.0 (I4.0), IoT, autonomous driving, connected vehicles, etc. With our award-winning talents (IEEE VIS best paper & best paper runner-ups), we also actively collaborate with leading groups in academia and industry to promote research ideas and publish research findings in internationally renowned conferences and journals, e.g., IEEE VIS, TVCG, SIGKDD, NeurIPS, AAAI, ICML, ICASSP, Interspeech, and IEEE Signal Processing Magazine.

Job Description

  • Develop state-of-the-art machine learning based audio signal processing and analysis technologies and modules
  • Collaborate with other colleagues to integrate audio signal processing and analysis modules into the prototype systems for Bosch Audio Event Detection applications and improve the performance with various technologies
  • Summarize research findings in high-quality paper and/or patent submissions


Basic Qualification:

The ideal candidate is familiar with the following skill sets:

  • Strong background in audio signal processing, machine learning and related fields
  • Hands-on experience of audio event detection or audio scene classification technologies
  • Hands-on experience of deep learning technologies and familiar with state-of-the-art deep learning toolkits
  • Good communication and team-working skills
  • Proficient with C/C++ and/or python programming
  • Proficient with Matlab

Preferred Qualification: 

  • Publication record in top machine learning and audio analytics venues (e.g. ICASSP, Interspeech, AAAI, ICML).
  • Experience on noise cancellation or source separation

Experience on audio signal localization

Additional Information

  • Major(s): Computer Science, Electrical Engineering or related fields
  • Degree level: Ph.D. – Must be a current student or recent graduate (less than 1 year)
  • Start Date: Summer 2021
  • Duration: 3 – 6 months

Videos To Watch

Privacy PolicyImprint