Setsugesuka commited on
Commit
7b21803
·
verified ·
1 Parent(s): 9b9e950

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -3
README.md CHANGED
@@ -1,3 +1,54 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ - zh
6
+ - ja
7
+ tags:
8
+ - speech
9
+ - singing
10
+ - singing voice
11
+ - audio
12
+ - music
13
+ - vocoder
14
+ - codec
15
+ - pytorch
16
+ ---
17
+
18
+ ## Aliasing Free Neural Audio Synthesis
19
+
20
+ This is the official Hugging Face model repository for the paper **"[Aliasing Free Neural Audio Synthesis](TBD)"**, which is the first work to achieve efficient and straightforward aliasing-free upsampling-based neural audio generation in the entire field of Neural Vocoder & Codec.
21
+
22
+ For more details, please visit our [GitHub Repository](https://github.com/sizigi/AliasingFreeNeuralAudioSynthesis).
23
+
24
+ ## Model Checkpoints
25
+
26
+ This repository contains the following checkpoints:
27
+
28
+ | Model Name | Directory | Description |
29
+ | ----------------- | ---------------------------- | ------------------------------------------------- |
30
+ | **Pupu-Vocoder_Small** | `./pupuvocoder/*` | 14M parameter small version of Pupu-Vocoder. |
31
+ | **Pupu-Vocoder_Large** | `./pupuvocoder_large/*` | 122M parameter large version of Pupu-Vocoder. |
32
+ | **Pupu-Codec_Small** | `./pupucodec/*` | 32M parameter small version of Pupu-Codec. |
33
+ | **Pupu-Codec_Large** | `./pupucodec_large/*` | 119M parameter large version of Pupu-Codec. |
34
+
35
+ ## How to use
36
+
37
+ To run this model, you need to put the pretrained models in:
38
+
39
+ ```bash
40
+ AliasingNeuralAudioSynthesis/experiments
41
+ ```
42
+
43
+ of our official repository, and then follow the instructions written in the repository to resume, finetune, and inference our pretrained checkpoints.
44
+
45
+ ## Citation
46
+
47
+ ```bibtex
48
+ @article{afgen,
49
+ title = {Aliasing Free Neural Audio Synthesis},
50
+ author = {Yicheng Gu and Junan Zhang and Chaoren Wang and Jerry Li and Zhizheng Wu and Lauri Juvela},
51
+ year = {2025},
52
+ journal = {TBD},
53
+ }
54
+ ```