Sony Hiring for Speech Recognition Intern Role – Apply Now!

Sony Hiring for Speech Recognition Intern Role. Interested Candidates can go through the details and apply using the link provided at the bottom of the Post. Apply Before the job is filled.

Sony – Speech Recognition Intern

Company nameSONY
Websitewww.sony.com
Job RoleSpeech Recognition Intern
Work LocationBengaluru, Karnataka, India
Job TypeInternship
ExperienceFreshers
QualificationCurrently pursuing/completed Masters in (Research) or Ph.D. in deep learning/machine learning with hands-on experience on Transformer models with an applications audio/speech.
BatchNot Mentioned
PackageAs Per Company Standards

Job Description

Key Responsibilities

  • Research and design methods to improve the robustness of ASR systems in challenging scenarios such as noisy environments, low-resource languages, or domain shifts.
  • Study hallucination issues in end-to-end ASR models (e.g., Whisper, Wav2Vec2) and develop effective mitigation strategies.
  • Perform large-scale experiments on speech datasets and analyze ASR performance across diverse noise conditions and linguistic variations.
  • Contribute to research publications, technical documentation, or open-source tools based on the outcomes of the work.

Work Location

Remote

Internship Duration

  • Paid internship for 6 months, starting from the second week of August 2025.
  • Working hours: 9:00 AM to 6:00 PM (Monday to Friday)

Qualifications

  • Currently pursuing or completed a Master’s (Research) or Ph.D. in Deep Learning/Machine Learning.
  • Hands-on experience with Transformer models applied to audio or speech-related tasks.

Must-Have Skills

  • Proficiency in Python and experience with PyTorch or TensorFlow.
  • Familiarity with speech processing libraries like Torchaudio, ESPnet, or Hugging Face Transformers.
  • Prior exposure to ASR models such as Wav2Vec2, Whisper, or RNN-T.
  • Ability to read and implement research papers effectively.
  • Strong understanding of machine learning and signal processing fundamentals.

Good-to-Have Skills

  • Knowledge of prompt tuning, contrastive learning, or multi-modal architectures.
  • Experience in evaluating hallucinations or generating synthetic speech/audio perturbations.

How to Apply?

  • To apply for a job, read through all information provided on the job listing page carefully.
  • Look for the apply link on the job listing page, usually located somewhere on the page.
  • Clicking on the apply link will take you to the company’s application portal.
  • Enter your personal details and any other information requested by the company in the application portal.
  • Pay close attention to the instructions provided and fill out all necessary fields accurately and completely.
  • Double-check all the information provided before submitting the application.
  • Ensure that your contact information is correct and up-to-date, and accurately reflect your qualifications and experience.
  • Submitting an application with incorrect or incomplete information could harm your chances of being selected for an interview.

About SONY

Sony is a renowned multinational conglomerate that has established itself as a prominent player in various industries. With its headquarters in Tokyo, Japan, Sony has made significant contributions to the fields of electronics, entertainment, gaming, and telecommunications. The company’s commitment to innovation and quality has resulted in the development of cutting-edge products that have captured the imagination of consumers worldwide. Sony’s diverse portfolio encompasses a wide range of consumer electronics, including televisions, cameras, audio equipment, and gaming consoles. Moreover, the company’s entertainment division has produced critically acclaimed movies, music, and television shows, showcasing its creative prowess. With a rich history and a strong global presence, Sony continues to shape the future of technology and entertainment, delivering experiences that enrich the lives of people everywhere.