🧩 分享一些日常收集到的开源软件、开发工具和技术知识。
Useful open-source projects, dev tools, and tech snippets — shared from daily discoveries.
GPT-SoVITS-WebUI


Features:
Zero-shot TTS: Input a 5-second vocal sample and experience instant text-to-speech conversion.

Few-shot TTS: Fine-tune the model with just 1 minute of training data for improved voice similarity and realism.

Cross-lingual Support: Inference in languages different from the training dataset, currently supporting English, Japanese, Korean, Cantonese and Chinese.

WebUI Tools: Integrated tools include voice accompaniment separation, automatic training set segmentation, Chinese ASR, and text labeling, assisting beginners in creating training datasets and GPT/SoVITS models.

https://private-user-images.githubusercontent.com/129054828/297098117-05bee1fa-bdd8-4d85-9350-80c060ab47fb.mp4?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3NDYzMzI4ODYsIm5iZiI6MTc0NjMzMjU4NiwicGF0aCI6Ii8xMjkwNTQ4MjgvMjk3MDk4MTE3LTA1YmVlMWZhLWJkZDgtNGQ4NS05MzUwLTgwYzA2MGFiNDdmYi5tcDQ_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwNTA0JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDUwNFQwNDIzMDZaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1iN2E1NjFhNzcyYWYzMTcyNmVhM2E1N2NmYTQ1YTUzZWY4MDczODFkMzc4ODA0MjNhMzYzNGMwZWJjMGU3OWM4JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9._5KES7yV9cr_Hj76obZFPp2M746iMnZGvCQiTSl0agE

https://github.com/RVC-Boss/GPT-SoVITS/

#tts
 
 
Back to Top