We’re building the future of human connection and the technologies that make it possible.

We’re building the future of human connection and the technologies that make it possible.
As a creative entertainment company, Sony seeks to utilize AI to unleash the potential of human creativity. For artists and creators, we aim to elevate their creativity to the next level. For those who enjoy entertainment, we aim to transform their lifestyles. Through this, we aspire to revolutionize culture and bring inspiration to lives around the world.
Founded 40 years ago on the simple idea of creating innovative products that change the world, Adobe offers groundbreaking technology that empowers everyone, everywhere to imagine, create, and bring any digital experience to life. At the heart of this innovation is Adobe Research, which is leading the way in audio AI technology for human-centered creativity. Our goal is to empower people to bring their creative ideas to life through high quality audio and video content. We achieve this through our research in the analysis, processing, and generation of speech, music, everyday sounds, and more. Our research also explores the intersection of audio with video, augmented reality, and natural language processing. To advance all of these research areas, we develop new machine learning models, novel signal processing algorithms, and new human computer interaction paradigms.
Mitsubishi Electric Research Labs (MERL) conducts both fundamental and applied research in artificial intelligence, signal processing, control, optimization and multi-physics modeling.
We are an open lab, publishing our results, collaborating with the world-wide research community, and measuring our performance by the impact we have on Mitsubishi Electric and the world. MERL researchers have the freedom to explore their scientific passions more fully. With an emphasis on the future, both the scientists and the management team take a long-term view of the research.
DataForce delivers high-quality, multimodal training data and services to power the next generation of AI. From large language models to voice, image, and video generation, DataForce supports AI innovators in tech, life sciences, automotive, and beyond with scalable, secure solutions for development, testing, and safety. Backed by cutting-edge technology and over one million data contributors, DataForce helps ensure AI systems are accurate, adaptable, and ready for real-world deployment. DataForce is part of TransPerfect, the world’s largest provider of language and AI solutions for global business, with offices in more than 140 cities worldwide. Learn more at www.dataforce.ai.
Dataocean AI is a leading global provider of data collection and annotation services, combining advanced technology with a diverse network of millions of data contributors, scientists, and engineers to deliver cutting-edge data for AI development. The company offers comprehensive data solutions across text, audio, image, and multimodal domains, supporting foundation models and generative AI applications. With over 1,790 off-the-shelf datasets and a proven track record of delivering thousands of customized data projects, Dataocean AI has earned the trust of more than 1,100 leading AI enterprises and institutions worldwide, including Meta, Microsoft, Amazon, IBM, Intel, BMW, Toshiba, Sony, Hyundai, Samsung, NUS, and Johns Hopkins University. Its services cover more than 247 languages and dialects worldwide, and the proprietary data platform ensures precise, efficient, and scalable execution across data collection, cleaning, labeling, and evaluation. With two decades of industry experience, Dataocean AI has established itself as a reliable and strategic partner in the global AI ecosystem, consistently delivering high-quality solutions and earning international recognition.
Google’s mission is to organize the world’s information and make it universally accessible and useful, and we advance that mission every day in incredible new ways. Research across Google provides new ways of looking at old problems and helps transform how we all work and live, and we think the biggest impact comes when everyone in the world can access it. To that end, we use state-of-the-art computer science techniques to solve problems for our users, our customers and the world, making it easier for you to do things every day, whether it’s searching for photos of people you love, breaking down language barriers, or helping you get things done with your own personal digital assistant.
Shanda AI Research Tokyo is the core research institute of Shanda Group Corp., dedicated to creating the next generation of intelligent game and entertainment applications. Founded in 2025 in Tokyo, the lab focuses on interactive and spatial intelligence, combining multimodal perception, spatiotemporal long-term memory, and world-model reasoning to enable personality-consistent, self-evolving digital humans. Our research spans speech and motion synthesis, facial and body animation, multimodal reasoning, self-evo characters, diffusion-based rendering, and worldmodels with spatiotemporal memory, aiming to merge scientific rigor with artistic creativity. Led by researchers from the University of Tokyo and Institute of Tokyo Science, the team collaborates with global partners across Japan, the U.S., and Singapore to build an ecosystem where artificial intelligence becomes a new form of life—learning, feeling, and evolving alongside humans.
Treble developed a cloud based acoustic simulation engine that combines massively accelerated wave based finite element modeling with phased geometrical acoustics to deliver high accuracy full bandwidth simulations at scale. This technology enables, generation of high quality synthetic audio data and virtual prototyping workflows that advance research in acoustics, audio algorithms and audio technology.
Analog Devices, Inc. (NASDAQ: ADI) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, AI, and software technologies into solutions that combat climate change, reliably connect humans and the world, and help drive advancements in automation and robotics, mobility, healthcare, energy and data centers. With revenue of more than $11 billion in FY25, ADI ensures today’s innovators stay Ahead of What’s Possible.
“Deepshare” provides online education and research tutoring service focusing on artificial intelligence. Since its establishment in 2017, it has provided more than 150,000 members with lectures and services covering AI theory, algorithms, programming, competitions, and scientific research by hiring more than total 500 outstanding AI researchers. The instructors and students come from all over the world, most of them study or do research in universities.
Helsing is an AI and software company in the defence sector, focused on critical AI capabilities for the protection of our democracies. Founded as a technology company with the sole business purpose of developing and introducing AI capabilities in the security sector. Helsing’s mission is to empower democratic societies as a European technology pioneer, enabling them to make sovereign decisions and enforce their own ethical standards. Our teams develop technologies that bring the operational capabilities of new and existing defence systems into a new era. We design and develop new types of autonomous systems and work with governments and industry to integrate existing hardware into a new, AI-enabled network. Our mission: ‘Artificial intelligence for the protection of our democracies’.
Josh Talks is a speech research lab that designs and produces pipeline-ready conversational audio datasets across Indian languages. Our data captures real-world accents, dialects, code-mixing, and noisy environments at scale – with channel-separated audio, per-speaker metadata, and multi-tier annotation. Built for teams training and benchmarking ASR and speech-to-speech models with zero preprocessing overhead.
Tobii is the global leader in eye tracking and a pioneer in attention computing. For more than 20 years, we have developed technology that helps devices and machines understand human attention and intent. Our solutions are used in areas such as scientific research, gaming, extended reality, assistive technology, and automotive interior sensing. Tobii is headquartered in Stockholm, Sweden, operates globally, and is listed on Nasdaq Stockholm (TOBII).
VinUniversity is a bold new force in global higher education—founded by Vingroup and shaped through strategic alliances with Cornell and UPenn. In just five years, VinUni earned QS 5 Stars and full FIBAA accreditation, redefining excellence while cultivating visionary leaders ready to drive global impact.
ai-coustics builds the audio intelligence layer for Voice AI. Our SDK conditions raw audio into stable, machine-ready input through real-time noise handling, speaker isolation, and voice activity detection. By making audio reliable under real conditions, we enable voice systems that work consistently the moment they leave the lab.
Artificial Intelligence Security Research Center (AISRC) is dedicated to advancing AI security through cutting-edge research and innovative solutions. Our expertise spans Deepfake Voice Detection, Spoofing-Aware Speaker Verification, Speech Anonymization, Audio Watermarking and Proactive Defense against Voice Deepfakes. Our mission is to develop pioneering research and practical applications that safeguard users against fraud and misinformation. Based in Seoul, South Korea, AISRC collaborates with industry partners and academic institutions to develop robust solutions that protect individuals and organizations from emerging digital threats.
Besimple AI is building the data layer for AI, starting with audio. We provide licensed audio data, expert annotation, and voice actor support for teams developing speech, voice, and multimodal models. Our proprietary datasets span conversational and other audio formats across diverse languages, dialects, accents, and real-world scenarios. Combined with expert workflows for transcription, diarization, emotion tagging, and custom labeling, we help teams move from data sourcing to model-ready training and evaluation data efficiently and at scale.
Cambridge University Press is a not-for-profit publisher with a mission to unlock people’s potential with the best learning and research solutions. Visit our stand at the conference or go to Link Coming Soon to browse our latest book and journal publications and get a 30% discount.
We’re a community of scientists, product managers, engineers, industrial designers, and entertainment enthusiasts. Creating technologies that breathe life into the experiences and imaginations of fellow artists, filmmakers, musicians, and storytellers is our passion. We’ve spent the last 50 years transforming audiovisual experiences. Join us at the intersection of art and technology, and be part of our future.
Frontiers is an open-access publisher accelerating research discovery through rigorous peer review and innovative publishing. Frontiers in Signal Processing features high-impact work across signal processing theory, algorithms, and applications—audio, speech, imaging, communications, machine learning and more. Stop by our stand to meet the team and explore how we help researchers publish quickly, transparently, and reach a global audience.
LXT delivers industry-leading AI data solutions across the entire AI lifecycle. From data collection and annotation to evaluation and fine-tuning, we power GenAI, LLMs, and multimodal AI systems with scalable, high-quality data pipelines. With 8M+ vetted contributors in 150+ countries and 1,000+ language locales, LXT enables culturally aligned, domain-specific data delivery at scale. For sensitive AI initiatives, we offer secure handling via five ISO 27001–certified facilities. Engagement options include fully managed services or Crowd-as-a-Service (CaaS) for flexible integration with enterprise workflows.
‘Magic Data is a world-leading provider of high-quality dataset solutions, with a core team that has been deeply engaged in the AI conversational data field for nearly two decades. We specialize in providing high-quality, professional, and multimodal conversational training datasets and consulting services for enterprises and research institutions in the artificial intelligence domain.
MATLAB and Simulink are used as fundamental modeling and simulation tools for research and development wherever engineering and science is applied. More than 6,500 colleges and universities use MATLAB and Simulink for teaching and research. MathWorks products also help prepare students for careers in industry, where the tools are widely used for algorithm development, data analysis, visualization, and numeric computation.
With a portfolio of over 2,700 journals and over 220,000 books, Springer is a global leader in academic and scientific publishing. We empower authors to share impactful research, enable readers to access trusted content, and collaborate with institutions and communities to advance knowledge worldwide. Whether you’re publishing cutting-edge science or foundational texts, Springer provides the reach, credibility, and support to help your work make a lasting difference.
Voices is the trusted platform for performance-grade, actor-powered voice solutions — from professional voice over to Voice Data and Voice AI. For more than 20 years, enterprises including Microsoft, BMW, and Cisco have relied on Voices for talent, technology, and production expertise backed by exclusive brand voice licensing, documented consent, and a global network of professional voice actors across 185 countries. Learn more at voices.com.
Most TTS papers evaluate with fewer than 20 listeners using different sentences and scales – the results are incomparable. Voice Arena provides public TTS benchmarks and leaderboards, delivering reliable comparisons, deep performance insights, and standardized methodologies to help researchers and companies measure, improve, and understand voice systems across languages and real-world use cases. Submit your model. Prove your voice.
Wave Sciences delivers real-world targeted signal extraction in near- and far-field conditions, even at significantly negative SNR, using its patented, physics-based, 3D acoustics, AI engine. It enhances and isolates signals in dynamic environments with simultaneous speech, reverberation, and interposed talkers, enabling applications where conventional methods fail, including forensics, voice interfaces, hearing assistance, and transcription.