From 75f98e80c434bf2b14568c752837544a22d14096 Mon Sep 17 00:00:00 2001 From: mlbv <51232730+mlbv@users.noreply.github.com> Date: Wed, 19 Jul 2023 04:55:19 +0800 Subject: [PATCH 1/6] Update README_zh_CN.md MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit 将README_zh_CN.md统一至英文版风格 --- README_zh_CN.md | 14 ++++++++++---- 1 file changed, 10 insertions(+), 4 deletions(-) diff --git a/README_zh_CN.md b/README_zh_CN.md index af85e92..5324624 100644 --- a/README_zh_CN.md +++ b/README_zh_CN.md @@ -1,12 +1,19 @@ +
+ # SoftVC VITS Singing Voice Conversion [**English**](./README.md) | [**中文简体**](./README_zh_CN.md) -#### ✨ 带有 F0 曲线编辑器,角色混合时间轴编辑器的推理端 (Onnx 模型的用途) : [MoeVoiceStudio](https://github.com/NaruseMioShirakana/MoeVoiceStudio) +[![在Google Cloab中打开](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/svc-develop-team/so-vits-svc/blob/4.1-Stable/sovits4_for_colab.ipynb) +[![LICENSE](https://img.shields.io/badge/LICENSE-AGPL3.0-green.svg?style=for-the-badge)]([https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/LICENSE](https://github.com/svc-develop-team/so-vits-svc/blob/4.1-Stable/LICENSE)) -#### ✨ 改善了交互的一个分支推荐:[34j/so-vits-svc-fork](https://github.com/34j/so-vits-svc-fork) +
-#### ✨ 支持实时转换的一个客户端:[w-okada/voice-changer](https://github.com/w-okada/voice-changer) +> ✨ 带有 F0 曲线编辑器,角色混合时间轴编辑器的推理端 (Onnx 模型的用途): [MoeVoiceStudio](https://github.com/NaruseMioShirakana/MoeVoiceStudio) + +> ✨ 改善了交互的一个分支推荐: [34j/so-vits-svc-fork](https://github.com/34j/so-vits-svc-fork) + +> ✨ 支持实时转换的一个客户端: [w-okada/voice-changer](https://github.com/w-okada/voice-changer) **本项目与 Vits 有着根本上的不同。Vits 是 TTS,本项目是 SVC。本项目无法实现 TTS,Vits 也无法实现 SVC,这两个项目的模型是完全不通用的。** @@ -412,7 +419,6 @@ python train_index.py -c configs/config.json + `inference_main.py`中指定`cluster_model_path` 为模型输出文件,留空则默认为`logs/44k/feature_and_index.pkl` + `inference_main.py`中指定`cluster_infer_ratio`,`0`为完全不使用特征检索,`1`为只使用特征检索,通常设置`0.5`即可 -### [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/svc-develop-team/so-vits-svc/blob/4.1-Stable/sovits4_for_colab.ipynb) [sovits4_for_colab.ipynb](https://colab.research.google.com/github/svc-develop-team/so-vits-svc/blob/4.1-Stable/sovits4_for_colab.ipynb) ## 🗜️ 模型压缩 From db8673281d3802a095baf55d8477682e34004683 Mon Sep 17 00:00:00 2001 From: mlbv <51232730+mlbv@users.noreply.github.com> Date: Wed, 19 Jul 2023 05:17:02 +0800 Subject: [PATCH 2/6] Update README_zh_CN.md Fix LICENSE, the button should work normally now --- README_zh_CN.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README_zh_CN.md b/README_zh_CN.md index 5324624..ebb52ac 100644 --- a/README_zh_CN.md +++ b/README_zh_CN.md @@ -5,7 +5,7 @@ [**English**](./README.md) | [**中文简体**](./README_zh_CN.md) [![在Google Cloab中打开](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/svc-develop-team/so-vits-svc/blob/4.1-Stable/sovits4_for_colab.ipynb) -[![LICENSE](https://img.shields.io/badge/LICENSE-AGPL3.0-green.svg?style=for-the-badge)]([https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/LICENSE](https://github.com/svc-develop-team/so-vits-svc/blob/4.1-Stable/LICENSE)) +[![LICENSE](https://img.shields.io/badge/LICENSE-AGPL3.0-green.svg?style=for-the-badge)](https://github.com/svc-develop-team/so-vits-svc/blob/4.1-Stable/LICENSE) From 2fb389078e265efe1556cab9fd0e191ca9877098 Mon Sep 17 00:00:00 2001 From: mlbv <51232730+mlbv@users.noreply.github.com> Date: Wed, 19 Jul 2023 05:18:05 +0800 Subject: [PATCH 3/6] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index a41f214..1cef5b5 100644 --- a/README.md +++ b/README.md @@ -5,7 +5,7 @@ [**English**](./README.md) | [**中文简体**](./README_zh_CN.md) [![Open In Colab](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/svc-develop-team/so-vits-svc/blob/4.1-Stable/sovits4_for_colab.ipynb) -[![Licence](https://img.shields.io/badge/LICENSE-AGPL3.0-green.svg?style=for-the-badge)]([https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/LICENSE](https://github.com/svc-develop-team/so-vits-svc/blob/4.1-Stable/LICENSE)) +[![Licence](https://img.shields.io/badge/LICENSE-AGPL3.0-green.svg?style=for-the-badge)](https://github.com/svc-develop-team/so-vits-svc/blob/4.1-Stable/LICENSE) From 0e39e3f8896a96ad2964fa3e1b3dc3dbe233a5b1 Mon Sep 17 00:00:00 2001 From: mlbv <51232730+mlbv@users.noreply.github.com> Date: Wed, 19 Jul 2023 19:28:15 +0800 Subject: [PATCH 4/6] Update losses.py fix incorrect kl loss calculation --- modules/losses.py | 3 +++ 1 file changed, 3 insertions(+) diff --git a/modules/losses.py b/modules/losses.py index 494e979..136df2f 100644 --- a/modules/losses.py +++ b/modules/losses.py @@ -50,9 +50,12 @@ def kl_loss(z_p, logs_q, m_p, logs_p, z_mask): m_p = m_p.float() logs_p = logs_p.float() z_mask = z_mask.float() + sigma = torch.exp(logs_p) + sigma = sigma * z_mask #print(logs_p) kl = logs_p - logs_q - 0.5 kl += 0.5 * ((z_p - m_p)**2) * torch.exp(-2. * logs_p) + kl += 0.5 * torch.log(2 * torch.pi * sigma**2) kl = torch.sum(kl * z_mask) l = kl / torch.sum(z_mask) return l From 107b2ce87bbcd7ee82d57c13a54675df6f89a580 Mon Sep 17 00:00:00 2001 From: YuriHead Date: Fri, 21 Jul 2023 02:18:56 +0800 Subject: [PATCH 5/6] Update losses.py --- modules/losses.py | 3 --- 1 file changed, 3 deletions(-) diff --git a/modules/losses.py b/modules/losses.py index 136df2f..494e979 100644 --- a/modules/losses.py +++ b/modules/losses.py @@ -50,12 +50,9 @@ def kl_loss(z_p, logs_q, m_p, logs_p, z_mask): m_p = m_p.float() logs_p = logs_p.float() z_mask = z_mask.float() - sigma = torch.exp(logs_p) - sigma = sigma * z_mask #print(logs_p) kl = logs_p - logs_q - 0.5 kl += 0.5 * ((z_p - m_p)**2) * torch.exp(-2. * logs_p) - kl += 0.5 * torch.log(2 * torch.pi * sigma**2) kl = torch.sum(kl * z_mask) l = kl / torch.sum(z_mask) return l From c50bb3b545467afe2cddd65fe5d415eb1d5c90bf Mon Sep 17 00:00:00 2001 From: YuriHead Date: Fri, 21 Jul 2023 02:23:32 +0800 Subject: [PATCH 6/6] Update README_zh_CN.md --- README_zh_CN.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README_zh_CN.md b/README_zh_CN.md index fec1f7e..9849091 100644 --- a/README_zh_CN.md +++ b/README_zh_CN.md @@ -10,11 +10,11 @@ -> ✨ 带有 F0 曲线编辑器,角色混合时间轴编辑器的推理端 (Onnx 模型的用途): [MoeVoiceStudio](https://github.com/NaruseMioShirakana/MoeVoiceStudio) +#### ✨ 带有 F0 曲线编辑器,角色混合时间轴编辑器的推理端 (Onnx 模型的用途): [MoeVoiceStudio](https://github.com/NaruseMioShirakana/MoeVoiceStudio) -> ✨ 改善了交互的一个分支推荐: [34j/so-vits-svc-fork](https://github.com/34j/so-vits-svc-fork) +#### ✨ 改善了交互的一个分支推荐: [34j/so-vits-svc-fork](https://github.com/34j/so-vits-svc-fork) -> ✨ 支持实时转换的一个客户端: [w-okada/voice-changer](https://github.com/w-okada/voice-changer) +#### ✨ 支持实时转换的一个客户端: [w-okada/voice-changer](https://github.com/w-okada/voice-changer) **本项目与 Vits 有着根本上的不同。Vits 是 TTS,本项目是 SVC。本项目无法实现 TTS,Vits 也无法实现 SVC,这两个项目的模型是完全不通用的。**