Add better text to Audio,-TTS (text-to-speech) setup—auto-generated from text. It sounds so robotic or flat compared to more advanced neural voices from the original/offical LLMs