From 18f747d1310146207d155d2b1769e2ddb9823a2a Mon Sep 17 00:00:00 2001 From: YuriHead Date: Wed, 5 Apr 2023 19:55:08 +0800 Subject: [PATCH] Update README.md --- README.md | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/README.md b/README.md index 26c605d..0e9d4f0 100644 --- a/README.md +++ b/README.md @@ -144,6 +144,7 @@ Optional parameters: see the next section - `-a` | `--auto_predict_f0`: automatic pitch prediction for voice conversion, do not enable this when converting songs as it can cause serious pitch issues. - `-cm` | `--cluster_model_path`: path to the clustering model, fill in any value if clustering is not trained. - `-cr` | `--cluster_infer_ratio`: proportion of the clustering solution, range 0-1, fill in 0 if the clustering model is not trained. +- `-fmp` | `--f0_mean_pooling`: Whether the mean filter (pooling) is used for F0 to improve some matte sounds. Note that enabling this option causes reasoning to slow down. ## 🤔 Optional Settings @@ -167,6 +168,11 @@ The existing steps before clustering do not need to be changed. All you need to - Specify "cluster_model_path" in inference_main. - Specify "cluster_infer_ratio" in inference_main, where 0 means not using clustering at all, 1 means only using clustering, and usually 0.5 is sufficient. +### F0 mean filtering + +Introduction: The mean filtering of F0 can effectively reduce the hoarse sound caused by the predicted fluctuation of pitch (the hoarse sound caused by reverb or harmony can not be eliminated temporarily). This function has been greatly improved on some songs. If the song appears dumb after reasoning, it can be considered to open. +- Set f0_mean_pooling to true in inference_main + ### [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1kv-3y2DmZo0uya8pEr1xk7cSB-4e_Pct?usp=sharing) [sovits4_for_colab.ipynb](https://colab.research.google.com/drive/1kv-3y2DmZo0uya8pEr1xk7cSB-4e_Pct?usp=sharing) #### [23/03/16] No longer need to download hubert manually