Release Date
01 15, 2026
As live streaming moves from simply being online to being stable, beautiful, intelligent, and easy to operate, user experience is no longer defined by hardware alone. It depends on how vision, audio, edge AI, and interaction are rebuilt as one system.
At CES 2026, Union Image and Malanshan Audio Lab jointly showcased an AI binocular live streaming camera for professional content creation and real-time broadcasting.

A Device-Side Vision and Audio System
This product is not a simple combination of a camera and microphone. It is designed around real creator workflows, integrating AI vision processing, binocular imaging, stabilization, gesture recognition, audio pickup, and interaction control.
High-performance visual processing helps maintain clear, smooth, and stable images under demanding live-streaming workloads, supporting reliable output for professional broadcasts and mobile content creation.

Binocular Imaging, Stabilization, and AI Interaction
The binocular camera system enables flexible switching between product display, close-up shots, and scene views. A three-axis mechanical gimbal combined with Union Image's algorithm helps compensate for movement, keeping the subject centered and the image stable.
AI gesture recognition allows creators to trigger shooting, recording, and live-streaming commands naturally. AI color styles and cinematic background blur help generate broadcast-ready visuals with a more professional look.

Audio Collaboration for Cleaner Live Content
With audio technology from Malanshan Audio Lab, the system uses a multi-microphone array and voice processing algorithms to capture sound from multiple directions, reduce environmental noise, and enhance vocal clarity.
The workflow supports one-click live streaming, pause control, camera and image-mode switching, fast audio and image adjustment, and multi-platform streaming support, allowing creators to focus on content instead of device setup.

This CES co-exhibition shows how device-side AI vision and audio processing can become a stable, reusable capability for creators, smart hardware brands, and real-world content scenarios. Vision for All means making reliable visual intelligence accessible at scale.



