Voice Funtion like chatgpt
in progress
S
Seb Wichmann
Hi everyone,
It’s been nearly 12 months since this post first went up, and I just spoke with a Merlin customer service rep. They mentioned that the feature is still in the ‘planned’ stage. Given how long it’s been without much progress, I’m curious—does Merlin still expect to roll this out?
As a pro user with a lifetime subscription, I’ve been exploring other options like PI, Sesame, Groq, ChatGPT, Gemini, and Copilot, many of which already offer these features. I really love using Merlin and hope to see this feature released soon.
Thanks for any updates or insights!
TecEgg
Building on the voice functionality previously requested, I would like to propose an enhancement to enable multilingual voice output. It would be highly beneficial if the AI could not only process voice inputs in different languages but also respond with spoken output in multiple languages.
This feature would greatly enhance accessibility for users worldwide, allowing them to interact with the AI in their preferred language, both in text and voice formats. It would be particularly useful for language learners, professionals, and users who require bilingual or multilingual support in their workflows. In my case i would love to see "german" added to the list of possible languages (TTS)
endu
Merged in a post:
Real-time conversation
C
CS
To be able to start a conversation through the Merlin app or through the desktop site as Chatgpt does.
Vijay Bharadwaj
Merged in a post:
Ability for voice input on Web
P
Philippos Christoforou
so we can talk instead of typing
M
Mowd Chen
It seems realtime voice conversation has been remove from the latest update in iOS version 5.4.3.
M
Marlo
Voice input and output is a must in these days on mobile and also desktop, since ChatGPT and Gemini has is, I tend to use them, instead of Merlin and I am thinking about stopping the Merlin subscription.
It is way better and quick to interact with ai through voice
endu
Merged in a post:
Voice and music generation - e.g., ElevenLabs (voice)
G
Guy
Any chance of getting voice generation and music generation features, like image generation?
The best I know for voice generation being https://elevenlabs.io/
Merlin
in progress
cc: Ethan Cohen
Work is underway currently for the Merlin mobile app only.
Realtime API has high costs due to which implementation for prolonged use time is hindered. We're looking to keep it mobile only FOR NOW since the voice-talking experience is much better suited by design for hand-held devices (the way we envision it), and anything at the scale of our desktop apps would be unsustainable.
We're also figuring out a way to give users the ability to chat with voice with all text LLMs we offer on Merlin Chat. The quality of the experience is the bottleneck so far. Thanks!
Y
YM
Merlin . It makes me wonder if a lightweight opensource local llm is the answer for such a use case
ELVETH
Merlin the ability to ask thing (by voice) to Merlin while walking/driving is a must, today. What’s the release date/roadmap expectation?
Ethan Cohen
Hey team - do we have a timeline to release on this one for desktop? Feel like we're getting a lot of likes but haven't seen any movement. endu
G
Guy
endu Another thing for me is I would really like to be able to talk out e-mails, and then have it take that audio and make a clearer e-mail.
Or the same thing for notes/ideas - I speak out the note/idea, and it then make it clearer and organised.
Similar to what Letterly does - https://letterly.app/
Load More
→