Ten

A tool to build real-time multimodal conversational AI agents.

Ten

Description

TEN is an open-source framework for building real-time multimodal conversational AI agents that can see, hear, and speak with users. It features a modular architecture that seamlessly integrates large language models with speech recognition, text-to-speech, vision processing, and real-time communications capabilities. Developers can create agents with natural voice interactions, visual understanding, and even animated avatars while easily swapping AI components through plug-and-play extensions without code changes. TEN distinguishes itself with its visual graph-based configuration system, support for cutting-edge real-time AI services like Gemini 2.0 Live and OpenAI Realtime, and compatibility with platforms like Dify and Coze. Organizations seeking low-latency conversational agents with multimodal capabilities will appreciate TEN's comprehensive AI stack that combines the flexibility of open-source development with production-grade performance for applications requiring natural human-AI interaction.

GitHub Note

Note: This is a GitHub repository, meaning that it is code that someone created and made publicly available for anyone to use. These tools could require some knowledge of coding.

Visit Website
Tool Details
  • Pricing: GitHub
  • Free Trial:

Similar Tools

AutoGPT (Hugging Face)
AutoGPT (Hugging Face)

A Hugging Face space to use AutoGPT.

EasyChat AI
EasyChat AI

A software to use chatgpt on Windows.

Flezr
Flezr

A NoCode website builder with Google Sheets or Supabase to create dynamic, data-driven websites.

OverChat AI
OverChat AI

A platform that unifies multiple AI models into one application for content creation and assistance tasks.

Wavve AI
Wavve AI

A tool to convert spoken audio into structured editable text summaries with tone customization options.

Caption My Photos
Caption My Photos

A tool to automate personalized captions generation for photos.