Update README.md

This commit is contained in:
Miuzarte 2023-04-05 19:01:30 +08:00
parent 3a339a5eaf
commit 9cf6826e0d
2 changed files with 22 additions and 22 deletions

View File

@ -131,19 +131,19 @@ python inference_main.py -m "logs/44k/G_30400.pth" -c "configs/config.json" -n "
```
Required parameters:
- -m, --model_path: path to the model.
- -c, --config_path: path to the configuration file.
- -n, --clean_names: a list of wav file names located in the raw folder.
- -t, --trans: pitch adjustment, supports positive and negative (semitone) values.
- -s, --spk_list: target speaker name for synthesis.
- -cl, --clip: voice forced slicing, set to 0 to turn off(default), duration in seconds.
- `-m` | `--model_path`: path to the model.
- `-c` | `--config_path`: path to the configuration file.
- `-n` | `--clean_names`: a list of wav file names located in the raw folder.
- `-t` | `--trans`: pitch adjustment, supports positive and negative (semitone) values.
- `-s` | `--spk_list`: target speaker name for synthesis.
- `-cl` | `--clip`: voice forced slicing, set to 0 to turn off(default), duration in seconds.
Optional parameters: see the next section
- -lg, --linear_gradient: The cross fade length of two audio slices in seconds. If there is a discontinuous voice after forced slicing, you can adjust this value. Otherwise, it is recommended to use the default value of 0.
- -fmp, --f0_mean_pooling: Apply mean filter (pooling) to f0which may improve some hoarse sounds. Enabling this option will reduce inference speed.
- -a, --auto_predict_f0: automatic pitch prediction for voice conversion, do not enable this when converting songs as it can cause serious pitch issues.
- -cm, --cluster_model_path: path to the clustering model, fill in any value if clustering is not trained.
- -cr, --cluster_infer_ratio: proportion of the clustering solution, range 0-1, fill in 0 if the clustering model is not trained.
- `-lg` | `--linear_gradient`: The cross fade length of two audio slices in seconds. If there is a discontinuous voice after forced slicing, you can adjust this value. Otherwise, it is recommended to use the default value of 0.
- `-fmp` | `--f0_mean_pooling`: Apply mean filter (pooling) to f0which may improve some hoarse sounds. Enabling this option will reduce inference speed.
- `-a` | `--auto_predict_f0`: automatic pitch prediction for voice conversion, do not enable this when converting songs as it can cause serious pitch issues.
- `-cm` | `--cluster_model_path`: path to the clustering model, fill in any value if clustering is not trained.
- `-cr` | `--cluster_infer_ratio`: proportion of the clustering solution, range 0-1, fill in 0 if the clustering model is not trained.
## 🤔 Optional Settings

View File

@ -131,19 +131,19 @@ python inference_main.py -m "logs/44k/G_30400.pth" -c "configs/config.json" -n "
```
必填项部分
+ -m, --model_path:模型路径
+ -c, --config_path:配置文件路径
+ -n, --clean_nameswav 文件名列表,放在 raw 文件夹下
+ -t, --trans:音高调整,支持正负(半音)
+ -s, --spk_list:合成目标说话人名称
+ -cl, --clip音频强制切片默认0为自动切片单位为秒/s
+ `-m` | `--model_path`:模型路径
+ `-c` | `--config_path`:配置文件路径
+ `-n` | `--clean_names`wav 文件名列表,放在 raw 文件夹下
+ `-t` | `--trans`:音高调整,支持正负(半音)
+ `-s` | `--spk_list`:合成目标说话人名称
+ `-cl` | `--clip`音频强制切片默认0为自动切片单位为秒/s
可选项部分:部分具体见下一节
+ -lg, --linear_gradient两段音频切片的交叉淡入长度如果强制切片后出现人声不连贯可调整该数值如果连贯建议采用默认值0单位为秒
+ -fmp, --f0_mean_pooling是否对F0使用均值滤波器(池化),对部分哑音可能有改善。注意,启动该选项会导致推理速度下降,默认关闭
+ -a, --auto_predict_f0:语音转换自动预测音高,转换歌声时不要打开这个会严重跑调
+ -cm, --cluster_model_path:聚类模型路径,如果没有训练聚类则随便填
+ -cr, --cluster_infer_ratio聚类方案占比范围0-1若没有训练聚类模型则默认0即可
+ `-lg` | `--linear_gradient`两段音频切片的交叉淡入长度如果强制切片后出现人声不连贯可调整该数值如果连贯建议采用默认值0单位为秒
+ `-fmp` | `--f0_mean_pooling`是否对F0使用均值滤波器(池化),对部分哑音可能有改善。注意,启动该选项会导致推理速度下降,默认关闭
+ `-a` | `--auto_predict_f0`:语音转换自动预测音高,转换歌声时不要打开这个会严重跑调
+ `-cm` | `--cluster_model_path`:聚类模型路径,如果没有训练聚类则随便填
+ `-cr` | `--cluster_infer_ratio`聚类方案占比范围0-1若没有训练聚类模型则默认0即可
## 🤔 可选项