标签专题

Digital Humans

这里聚合了带有“Digital Humans”标签的内容,适合沿着一个关键词或方法脉络继续看下去。

已发布文章
3
StoryMaker: An Open-Source Tool for Generating Personalized Stories from Photos
StoryMaker: An Open-Source Tool for Generating Personalized Stories from Photos
StoryMaker is an open-source AI writing tool that generates story content by uploading character photos, ensuring that the character's facial features, clothing, hairstyle, and body traits closely match the photo. It is suitable for novel writing, brand promotion, and game design scenarios. StoryMaker makes content more personalized, vivid, and realistic, supports customizable development, and provides strong support for creators.
Deepgram Launches AI Voice Agent API: The Future of Real-Time Conversation
Deepgram Launches AI Voice Agent API: The Future of Real-Time Conversation
Deepgram's newly released AI Voice Agent API delivers seamless real-time voice conversations. Leveraging advanced speech recognition and generation models, the API supports real-time dialogue, pause and interruption handling, and flexible integration with various large language models. Its low latency and strong privacy safeguards make it suitable for scenarios such as customer support and medical transcription.
ElevenLabs Launches the New AI Voice Generation Tool Voice Design: Create Personalized Voices with Text Prompts
ElevenLabs Launches the New AI Voice Generation Tool Voice Design: Create Personalized Voices with Text Prompts
ElevenLabs has introduced Voice Design, a cutting‑edge AI voice generation tool that lets users craft personalized speech simply by providing text prompts. Users can tailor attributes such as age, accent, gender, and intonation, and even design voices with mythic or sci‑fi character traits. The solution is ideal for advertising, gaming, podcasts, and more. Voice Design includes fine‑tuning capabilities, integrates seamlessly with ElevenLabs' text‑to‑speech platform, and will soon offer API access and real‑time voice synthesis.