root commited on
Commit
242e4df
·
1 Parent(s): 089db8c

init commit

Browse files
LICENSE ADDED
@@ -0,0 +1,390 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Tencent is pleased to support the open source community by making SongPrep available.
2
+
3
+ Copyright (C) 2025 Tencent. All rights reserved. The below components in this distribution may have been modified by Tencent ("Tencent Modifications"). All Tencent Modifications are Copyright (C) Tencent.
4
+
5
+ SongPrep is licensed under the License Terms of SongPrep except for the third-party components listed below, which is licensed under different terms. SongPrep does not impose any additional limitations beyond what is outlined in the respective licenses of these third-party components. Users must comply with all terms and conditions of original licenses of these third-party components and must ensure that the usage of the third party components adheres to all relevant laws and regulations.
6
+
7
+ In case you believe there have been errors in the attribution below, you may submit the concerns to us for review and correction.
8
+
9
+ For avoidance of doubts, SongPrep refers to the inference-enabling code, parameters and weights made publicly available by Tencent in accordance with the License Terms of SongPrep.
10
+
11
+ License Terms of SongPrep:
12
+ --------------------------------------------------------------------
13
+ Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
14
+
15
+ - You agree to use the SongPrep only for academic purposes, and refrain from using it for any commercial or production purposes under any circumstances.
16
+
17
+ - The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
18
+
19
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
20
+
21
+
22
+ --------------------------------------------------------------------
23
+ The SongPrep parameters and weights were fine-tuned based on the following models licensed under Apache-2.0:
24
+
25
+ 1.Qwen2-7B
26
+ Copyright 2024 Alibaba Cloud.
27
+
28
+ Terms of the Apache-2.0:
29
+ --------------------------------------------------------------------
30
+ Apache License
31
+ Version 2.0, January 2004
32
+ http://www.apache.org/licenses/
33
+
34
+ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
35
+
36
+ 1. Definitions.
37
+
38
+ "License" shall mean the terms and conditions for use, reproduction,
39
+ and distribution as defined by Sections 1 through 9 of this document.
40
+
41
+ "Licensor" shall mean the copyright owner or entity authorized by
42
+ the copyright owner that is granting the License.
43
+
44
+ "Legal Entity" shall mean the union of the acting entity and all
45
+ other entities that control, are controlled by, or are under common
46
+ control with that entity. For the purposes of this definition,
47
+ "control" means (i) the power, direct or indirect, to cause the
48
+ direction or management of such entity, whether by contract or
49
+ otherwise, or (ii) ownership of fifty percent (50%) or more of the
50
+ outstanding shares, or (iii) beneficial ownership of such entity.
51
+
52
+ "You" (or "Your") shall mean an individual or Legal Entity
53
+ exercising permissions granted by this License.
54
+
55
+ "Source" form shall mean the preferred form for making modifications,
56
+ including but not limited to software source code, documentation
57
+ source, and configuration files.
58
+
59
+ "Object" form shall mean any form resulting from mechanical
60
+ transformation or translation of a Source form, including but
61
+ not limited to compiled object code, generated documentation,
62
+ and conversions to other media types.
63
+
64
+ "Work" shall mean the work of authorship, whether in Source or
65
+ Object form, made available under the License, as indicated by a
66
+ copyright notice that is included in or attached to the work
67
+ (an example is provided in the Appendix below).
68
+
69
+ "Derivative Works" shall mean any work, whether in Source or Object
70
+ form, that is based on (or derived from) the Work and for which the
71
+ editorial revisions, annotations, elaborations, or other modifications
72
+ represent, as a whole, an original work of authorship. For the purposes
73
+ of this License, Derivative Works shall not include works that remain
74
+ separable from, or merely link (or bind by name) to the interfaces of,
75
+ the Work and Derivative Works thereof.
76
+
77
+ "Contribution" shall mean any work of authorship, including
78
+ the original version of the Work and any modifications or additions
79
+ to that Work or Derivative Works thereof, that is intentionally
80
+ submitted to Licensor for inclusion in the Work by the copyright owner
81
+ or by an individual or Legal Entity authorized to submit on behalf of
82
+ the copyright owner. For the purposes of this definition, "submitted"
83
+ means any form of electronic, verbal, or written communication sent
84
+ to the Licensor or its representatives, including but not limited to
85
+ communication on electronic mailing lists, source code control systems,
86
+ and issue tracking systems that are managed by, or on behalf of, the
87
+ Licensor for the purpose of discussing and improving the Work, but
88
+ excluding communication that is conspicuously marked or otherwise
89
+ designated in writing by the copyright owner as "Not a Contribution."
90
+
91
+ "Contributor" shall mean Licensor and any individual or Legal Entity
92
+ on behalf of whom a Contribution has been received by Licensor and
93
+ subsequently incorporated within the Work.
94
+
95
+ 2. Grant of Copyright License. Subject to the terms and conditions of
96
+ this License, each Contributor hereby grants to You a perpetual,
97
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
98
+ copyright license to reproduce, prepare Derivative Works of,
99
+ publicly display, publicly perform, sublicense, and distribute the
100
+ Work and such Derivative Works in Source or Object form.
101
+
102
+ 3. Grant of Patent License. Subject to the terms and conditions of
103
+ this License, each Contributor hereby grants to You a perpetual,
104
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
105
+ (except as stated in this section) patent license to make, have made,
106
+ use, offer to sell, sell, import, and otherwise transfer the Work,
107
+ where such license applies only to those patent claims licensable
108
+ by such Contributor that are necessarily infringed by their
109
+ Contribution(s) alone or by combination of their Contribution(s)
110
+ with the Work to which such Contribution(s) was submitted. If You
111
+ institute patent litigation against any entity (including a
112
+ cross-claim or counterclaim in a lawsuit) alleging that the Work
113
+ or a Contribution incorporated within the Work constitutes direct
114
+ or contributory patent infringement, then any patent licenses
115
+ granted to You under this License for that Work shall terminate
116
+ as of the date such litigation is filed.
117
+
118
+ 4. Redistribution. You may reproduce and distribute copies of the
119
+ Work or Derivative Works thereof in any medium, with or without
120
+ modifications, and in Source or Object form, provided that You
121
+ meet the following conditions:
122
+
123
+ (a) You must give any other recipients of the Work or
124
+ Derivative Works a copy of this License; and
125
+
126
+ (b) You must cause any modified files to carry prominent notices
127
+ stating that You changed the files; and
128
+
129
+ (c) You must retain, in the Source form of any Derivative Works
130
+ that You distribute, all copyright, patent, trademark, and
131
+ attribution notices from the Source form of the Work,
132
+ excluding those notices that do not pertain to any part of
133
+ the Derivative Works; and
134
+
135
+ (d) If the Work includes a "NOTICE" text file as part of its
136
+ distribution, then any Derivative Works that You distribute must
137
+ include a readable copy of the attribution notices contained
138
+ within such NOTICE file, excluding those notices that do not
139
+ pertain to any part of the Derivative Works, in at least one
140
+ of the following places: within a NOTICE text file distributed
141
+ as part of the Derivative Works; within the Source form or
142
+ documentation, if provided along with the Derivative Works; or,
143
+ within a display generated by the Derivative Works, if and
144
+ wherever such third-party notices normally appear. The contents
145
+ of the NOTICE file are for informational purposes only and
146
+ do not modify the License. You may add Your own attribution
147
+ notices within Derivative Works that You distribute, alongside
148
+ or as an addendum to the NOTICE text from the Work, provided
149
+ that such additional attribution notices cannot be construed
150
+ as modifying the License.
151
+
152
+ You may add Your own copyright statement to Your modifications and
153
+ may provide additional or different license terms and conditions
154
+ for use, reproduction, or distribution of Your modifications, or
155
+ for any such Derivative Works as a whole, provided Your use,
156
+ reproduction, and distribution of the Work otherwise complies with
157
+ the conditions stated in this License.
158
+
159
+ 5. Submission of Contributions. Unless You explicitly state otherwise,
160
+ any Contribution intentionally submitted for inclusion in the Work
161
+ by You to the Licensor shall be under the terms and conditions of
162
+ this License, without any additional terms or conditions.
163
+ Notwithstanding the above, nothing herein shall supersede or modify
164
+ the terms of any separate license agreement you may have executed
165
+ with Licensor regarding such Contributions.
166
+
167
+ 6. Trademarks. This License does not grant permission to use the trade
168
+ names, trademarks, service marks, or product names of the Licensor,
169
+ except as required for reasonable and customary use in describing the
170
+ origin of the Work and reproducing the content of the NOTICE file.
171
+
172
+ 7. Disclaimer of Warranty. Unless required by applicable law or
173
+ agreed to in writing, Licensor provides the Work (and each
174
+ Contributor provides its Contributions) on an "AS IS" BASIS,
175
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
176
+ implied, including, without limitation, any warranties or conditions
177
+ of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
178
+ PARTICULAR PURPOSE. You are solely responsible for determining the
179
+ appropriateness of using or redistributing the Work and assume any
180
+ risks associated with Your exercise of permissions under this License.
181
+
182
+ 8. Limitation of Liability. In no event and under no legal theory,
183
+ whether in tort (including negligence), contract, or otherwise,
184
+ unless required by applicable law (such as deliberate and grossly
185
+ negligent acts) or agreed to in writing, shall any Contributor be
186
+ liable to You for damages, including any direct, indirect, special,
187
+ incidental, or consequential damages of any character arising as a
188
+ result of this License or out of the use or inability to use the
189
+ Work (including but not limited to damages for loss of goodwill,
190
+ work stoppage, computer failure or malfunction, or any and all
191
+ other commercial damages or losses), even if such Contributor
192
+ has been advised of the possibility of such damages.
193
+
194
+ 9. Accepting Warranty or Additional Liability. While redistributing
195
+ the Work or Derivative Works thereof, You may choose to offer,
196
+ and charge a fee for, acceptance of support, warranty, indemnity,
197
+ or other liability obligations and/or rights consistent with this
198
+ License. However, in accepting such obligations, You may act only
199
+ on Your own behalf and on Your sole responsibility, not on behalf
200
+ of any other Contributor, and only if You agree to indemnify,
201
+ defend, and hold each Contributor harmless for any liability
202
+ incurred by, or claims asserted against, such Contributor by reason
203
+ of your accepting any such warranty or additional liability.
204
+
205
+ END OF TERMS AND CONDITIONS
206
+
207
+ APPENDIX: How to apply the Apache License to your work.
208
+
209
+ To apply the Apache License to your work, attach the following
210
+ boilerplate notice, with the fields enclosed by brackets "[]"
211
+ replaced with your own identifying information. (Don't include
212
+ the brackets!) The text should be enclosed in the appropriate
213
+ comment syntax for the file format. We also recommend that a
214
+ file or class name and description of purpose be included on the
215
+ same "printed page" as the copyright notice for easier
216
+ identification within third-party archives.
217
+ Copyright 2024 Alibaba Cloud
218
+ Licensed under the Apache License, Version 2.0 (the "License");
219
+ you may not use this file except in compliance with the License.
220
+ You may obtain a copy of the License at
221
+ http://www.apache.org/licenses/LICENSE-2.0
222
+ Unless required by applicable law or agreed to in writing, software
223
+ distributed under the License is distributed on an "AS IS" BASIS,
224
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
225
+ See the License for the specific language governing permissions and
226
+ limitations under the License.
227
+
228
+ --------------------------------------------------------------------
229
+ The SongPrep is built on and with the aid of the following open source projects. Credits are given to these projects.
230
+
231
+ Open Source Software Licensed under the Apache-2.0:
232
+ --------------------------------------------------------------------
233
+ 1. huggingface-hub
234
+ Copyright 2023 The HuggingFace Team. All rights reserved.
235
+
236
+ 2. transformers
237
+ Copyright 2018- The Hugging Face team. All rights reserved.
238
+
239
+ Terms of the Apache-2.0:
240
+ --------------------------------------------------------------------
241
+ Apache License
242
+ Version 2.0, January 2004
243
+ http://www.apache.org/licenses/
244
+
245
+ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
246
+
247
+ 1. Definitions.
248
+
249
+ "License" shall mean the terms and conditions for use, reproduction, and distribution as defined by Sections 1 through 9 of this document.
250
+
251
+ "Licensor" shall mean the copyright owner or entity authorized by the copyright owner that is granting the License.
252
+
253
+ "Legal Entity" shall mean the union of the acting entity and all other entities that control, are controlled by, or are under common control with that entity. For the purposes of this definition, "control" means (i) the power, direct or indirect, to cause the direction or management of such entity, whether by contract or otherwise, or (ii) ownership of fifty percent (50%) or more of the outstanding shares, or (iii) beneficial ownership of such entity.
254
+
255
+ "You" (or "Your") shall mean an individual or Legal Entity exercising permissions granted by this License.
256
+
257
+ "Source" form shall mean the preferred form for making modifications, including but not limited to software source code, documentation source, and configuration files.
258
+
259
+ "Object" form shall mean any form resulting from mechanical transformation or translation of a Source form, including but not limited to compiled object code, generated documentation, and conversions to other media types.
260
+
261
+ "Work" shall mean the work of authorship, whether in Source or Object form, made available under the License, as indicated by a copyright notice that is included in or attached to the work (an example is provided in the Appendix below).
262
+
263
+ "Derivative Works" shall mean any work, whether in Source or Object form, that is based on (or derived from) the Work and for which the editorial revisions, annotations, elaborations, or other modifications represent, as a whole, an original work of authorship. For the purposes of this License, Derivative Works shall not include works that remain separable from, or merely link (or bind by name) to the interfaces of, the Work and Derivative Works thereof.
264
+
265
+ "Contribution" shall mean any work of authorship, including the original version of the Work and any modifications or additions to that Work or Derivative Works thereof, that is intentionally submitted to Licensor for inclusion in the Work by the copyright owner or by an individual or Legal Entity authorized to submit on behalf of the copyright owner. For the purposes of this definition, "submitted" means any form of electronic, verbal, or written communication sent to the Licensor or its representatives, including but not limited to communication on electronic mailing lists, source code control systems, and issue tracking systems that are managed by, or on behalf of, the Licensor for the purpose of discussing and improving the Work, but excluding communication that is conspicuously marked or otherwise designated in writing by the copyright owner as "Not a Contribution."
266
+
267
+ "Contributor" shall mean Licensor and any individual or Legal Entity on behalf of whom a Contribution has been received by Licensor and subsequently incorporated within the Work.
268
+
269
+ 2. Grant of Copyright License. Subject to the terms and conditions of this License, each Contributor hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable copyright license to reproduce, prepare Derivative Works of, publicly display, publicly perform, sublicense, and distribute the Work and such Derivative Works in Source or Object form.
270
+
271
+ 3. Grant of Patent License. Subject to the terms and conditions of this License, each Contributor hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable (except as stated in this section) patent license to make, have made, use, offer to sell, sell, import, and otherwise transfer the Work, where such license applies only to those patent claims licensable by such Contributor that are necessarily infringed by their Contribution(s) alone or by combination of their Contribution(s) with the Work to which such Contribution(s) was submitted. If You institute patent litigation against any entity (including a cross-claim or counterclaim in a lawsuit) alleging that the Work or a Contribution incorporated within the Work constitutes direct or contributory patent infringement, then any patent licenses granted to You under this License for that Work shall terminate as of the date such litigation is filed.
272
+
273
+ 4. Redistribution. You may reproduce and distribute copies of the Work or Derivative Works thereof in any medium, with or without modifications, and in Source or Object form, provided that You meet the following conditions:
274
+
275
+ You must give any other recipients of the Work or Derivative Works a copy of this License; and
276
+ You must cause any modified files to carry prominent notices stating that You changed the files; and
277
+ You must retain, in the Source form of any Derivative Works that You distribute, all copyright, patent, trademark, and attribution notices from the Source form of the Work, excluding those notices that do not pertain to any part of the Derivative Works; and
278
+ If the Work includes a "NOTICE" text file as part of its distribution, then any Derivative Works that You distribute must include a readable copy of the attribution notices contained within such NOTICE file, excluding those notices that do not pertain to any part of the Derivative Works, in at least one of the following places: within a NOTICE text file distributed as part of the Derivative Works; within the Source form or documentation, if provided along with the Derivative Works; or, within a display generated by the Derivative Works, if and wherever such third-party notices normally appear. The contents of the NOTICE file are for informational purposes only and do not modify the License. You may add Your own attribution notices within Derivative Works that You distribute, alongside or as an addendum to the NOTICE text from the Work, provided that such additional attribution notices cannot be construed as modifying the License.
279
+ You may add Your own copyright statement to Your modifications and may provide additional or different license terms and conditions for use, reproduction, or distribution of Your modifications, or for any such Derivative Works as a whole, provided Your use, reproduction, and distribution of the Work otherwise complies with the conditions stated in this License.
280
+
281
+ 5. Submission of Contributions. Unless You explicitly state otherwise, any Contribution intentionally submitted for inclusion in the Work by You to the Licensor shall be under the terms and conditions of this License, without any additional terms or conditions. Notwithstanding the above, nothing herein shall supersede or modify the terms of any separate license agreement you may have executed with Licensor regarding such Contributions.
282
+
283
+ 6. Trademarks. This License does not grant permission to use the trade names, trademarks, service marks, or product names of the Licensor, except as required for reasonable and customary use in describing the origin of the Work and reproducing the content of the NOTICE file.
284
+
285
+ 7. Disclaimer of Warranty. Unless required by applicable law or agreed to in writing, Licensor provides the Work (and each Contributor provides its Contributions) on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied, including, without limitation, any warranties or conditions of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A PARTICULAR PURPOSE. You are solely responsible for determining the appropriateness of using or redistributing the Work and assume any risks associated with Your exercise of permissions under this License.
286
+
287
+ 8. Limitation of Liability. In no event and under no legal theory, whether in tort (including negligence), contract, or otherwise, unless required by applicable law (such as deliberate and grossly negligent acts) or agreed to in writing, shall any Contributor be liable to You for damages, including any direct, indirect, special, incidental, or consequential damages of any character arising as a result of this License or out of the use or inability to use the Work (including but not limited to damages for loss of goodwill, work stoppage, computer failure or malfunction, or any and all other commercial damages or losses), even if such Contributor has been advised of the possibility of such damages.
288
+
289
+ 9. Accepting Warranty or Additional Liability. While redistributing the Work or Derivative Works thereof, You may choose to offer, and charge a fee for, acceptance of support, warranty, indemnity, or other liability obligations and/or rights consistent with this License. However, in accepting such obligations, You may act only on Your own behalf and on Your sole responsibility, not on behalf of any other Contributor, and only if You agree to indemnify, defend, and hold each Contributor harmless for any liability incurred by, or claims asserted against, such Contributor by reason of your accepting any such warranty or additional liability.
290
+
291
+ END OF TERMS AND CONDITIONS
292
+
293
+
294
+
295
+
296
+
297
+ Open Source Software Licensed under the BSD-2-Clause:
298
+ --------------------------------------------------------------------
299
+ 1. torchaudio
300
+ Copyright (c) 2017 Facebook Inc. (Soumith Chintala),
301
+ All rights reserved.
302
+ Terms of the BSD-2-Clause:
303
+ --------------------------------------------------------------------
304
+
305
+ Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
306
+
307
+ 1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
308
+ 2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
309
+ THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
310
+
311
+
312
+
313
+
314
+
315
+ Open Source Software Licensed under the BSD-3-Clause:
316
+ --------------------------------------------------------------------
317
+ 1. torch
318
+ From PyTorch:
319
+
320
+ Copyright (c) 2016- Facebook, Inc (Adam Paszke)
321
+ Copyright (c) 2014- Facebook, Inc (Soumith Chintala)
322
+ Copyright (c) 2011-2014 Idiap Research Institute (Ronan Collobert)
323
+ Copyright (c) 2012-2014 Deepmind Technologies (Koray Kavukcuoglu)
324
+ Copyright (c) 2011-2012 NEC Laboratories America (Koray Kavukcuoglu)
325
+ Copyright (c) 2011-2013 NYU (Clement Farabet)
326
+ Copyright (c) 2006-2010 NEC Laboratories America (Ronan Collobert, Leon Bottou, Iain Melvin, Jason Weston)
327
+ Copyright (c) 2006 Idiap Research Institute (Samy Bengio)
328
+ Copyright (c) 2001-2004 Idiap Research Institute (Ronan Collobert, Samy Bengio, Johnny Mariethoz)
329
+
330
+ From Caffe2:
331
+
332
+ Copyright (c) 2016-present, Facebook Inc. All rights reserved.
333
+
334
+ All contributions by Facebook:
335
+ Copyright (c) 2016 Facebook Inc.
336
+
337
+ All contributions by Google:
338
+ Copyright (c) 2015 Google Inc.
339
+ All rights reserved.
340
+
341
+ All contributions by Yangqing Jia:
342
+ Copyright (c) 2015 Yangqing Jia
343
+ All rights reserved.
344
+
345
+ All contributions by Kakao Brain:
346
+ Copyright 2019-2020 Kakao Brain
347
+
348
+ All contributions by Cruise LLC:
349
+ Copyright (c) 2022 Cruise LLC.
350
+ All rights reserved.
351
+
352
+ All contributions from Caffe:
353
+ Copyright(c) 2013, 2014, 2015, the respective contributors
354
+ All rights reserved.
355
+
356
+ All other contributions:
357
+ Copyright(c) 2015, 2016 the respective contributors
358
+ All rights reserved.
359
+
360
+ Terms of the BSD-3-Clause:
361
+ --------------------------------------------------------------------
362
+ BSD 3-Clause License
363
+
364
+ Redistribution and use in source and binary forms, with or without
365
+ modification, are permitted provided that the following conditions are met:
366
+
367
+ 1. Redistributions of source code must retain the above copyright notice, this
368
+ list of conditions and the following disclaimer.
369
+
370
+ 2. Redistributions in binary form must reproduce the above copyright notice,
371
+ this list of conditions and the following disclaimer in the documentation
372
+ and/or other materials provided with the distribution.
373
+
374
+ 3. Neither the name of the copyright holder nor the names of its
375
+ contributors may be used to endorse or promote products derived from
376
+ this software without specific prior written permission.
377
+
378
+ THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
379
+ AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
380
+ IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
381
+ DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
382
+ FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
383
+ DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
384
+ SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
385
+ CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
386
+ OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
387
+ OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
388
+
389
+ ==================================================
390
+ End of the Attribution Notice of this project.
README.md CHANGED
@@ -1,3 +1,42 @@
1
  ---
2
- license: apache-2.0
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - zh
4
+ - en
5
+ pipeline_tag: automatic-speech-recognition
6
  ---
7
+
8
+ # SongPrep
9
+
10
+ <p align="center"><img src="img/logo.jpg" width="40%"></p>
11
+ <p align="center">
12
+ <a href="https://song-prep.github.io/demo/">Demo</a> &nbsp;|&nbsp; <a href="https://arxiv.org/abs/2509.17404">Paper</a> &nbsp;|&nbsp; <a href="https://github.com/tencent-ailab/songprep">Code</a> &nbsp;|&nbsp; <a href="https://huggingface.co/datasets/waytan22/SSLD-200">Dataset</a>
13
+ </p>
14
+
15
+ This repository is the official weight repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Transcription. In this repository, we provide the SongPrep-7B model that has been trained on the Million Song Dataset.
16
+
17
+
18
+ ## Model Versions
19
+
20
+ | Model | #Params | HuggingFace |
21
+ | :----------------------: | :----------------------: | :---------------------------------------------------: |
22
+ | SongPrep | 7B |you are here |
23
+
24
+
25
+ ## Citation
26
+
27
+ ```
28
+ @misc{tan2025songpreppreprocessingframeworkendtoend,
29
+ title={SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Transcription},
30
+ author={Wei Tan and Shun Lei and Huaicheng Zhang and Guangzheng Li and Yixuan Zhang and Hangting Chen and Jianwei Yu and Rongzhi Gu and Dong Yu},
31
+ year={2025},
32
+ eprint={2509.17404},
33
+ archivePrefix={arXiv},
34
+ primaryClass={eess.AS},
35
+ url={https://arxiv.org/abs/2509.17404},
36
+ }
37
+ ```
38
+
39
+
40
+ ## License
41
+
42
+ The code and weights in this repository is released in the [LICENSE](LICENSE) file.
config.json ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "Qwen2ForCausalLM"
4
+ ],
5
+ "attention_dropout": 0.0,
6
+ "bos_token_id": 151643,
7
+ "eos_token_id": 151643,
8
+ "hidden_act": "silu",
9
+ "hidden_size": 3584,
10
+ "initializer_range": 0.02,
11
+ "intermediate_size": 18944,
12
+ "max_position_embeddings": 131072,
13
+ "max_window_layers": 28,
14
+ "model_type": "qwen2",
15
+ "num_attention_heads": 28,
16
+ "num_hidden_layers": 28,
17
+ "num_key_value_heads": 4,
18
+ "rms_norm_eps": 1e-06,
19
+ "rope_theta": 1000000.0,
20
+ "sliding_window": 131072,
21
+ "tie_word_embeddings": false,
22
+ "torch_dtype": "bfloat16",
23
+ "transformers_version": "4.37.2",
24
+ "use_cache": true,
25
+ "use_sliding_window": false,
26
+ "vocab_size": 168040
27
+ }
img/logo.jpg ADDED
merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
model-00001-of-00002.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e09c5aaeca4df316a6da1979a549da296fa718b58408d3c7ac9020986ff39fa
3
+ size 9924927776
model-00002-of-00002.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c597ea004ab2b0a229c23695bd81fae3983234a68a68536875194288726e4fee
3
+ size 5535376064
model.safetensors.index.json ADDED
@@ -0,0 +1,346 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "metadata": {
3
+ "total_size": 15460264960
4
+ },
5
+ "weight_map": {
6
+ "lm_head.weight": "model-00002-of-00002.safetensors",
7
+ "model.embed_tokens.weight": "model-00001-of-00002.safetensors",
8
+ "model.layers.0.input_layernorm.weight": "model-00001-of-00002.safetensors",
9
+ "model.layers.0.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
10
+ "model.layers.0.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
11
+ "model.layers.0.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
12
+ "model.layers.0.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
13
+ "model.layers.0.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
14
+ "model.layers.0.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
15
+ "model.layers.0.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
16
+ "model.layers.0.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
17
+ "model.layers.0.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
18
+ "model.layers.0.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
19
+ "model.layers.0.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
20
+ "model.layers.1.input_layernorm.weight": "model-00001-of-00002.safetensors",
21
+ "model.layers.1.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
22
+ "model.layers.1.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
23
+ "model.layers.1.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
24
+ "model.layers.1.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
25
+ "model.layers.1.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
26
+ "model.layers.1.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
27
+ "model.layers.1.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
28
+ "model.layers.1.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
29
+ "model.layers.1.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
30
+ "model.layers.1.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
31
+ "model.layers.1.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
32
+ "model.layers.10.input_layernorm.weight": "model-00001-of-00002.safetensors",
33
+ "model.layers.10.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
34
+ "model.layers.10.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
35
+ "model.layers.10.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
36
+ "model.layers.10.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
37
+ "model.layers.10.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
38
+ "model.layers.10.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
39
+ "model.layers.10.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
40
+ "model.layers.10.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
41
+ "model.layers.10.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
42
+ "model.layers.10.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
43
+ "model.layers.10.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
44
+ "model.layers.11.input_layernorm.weight": "model-00001-of-00002.safetensors",
45
+ "model.layers.11.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
46
+ "model.layers.11.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
47
+ "model.layers.11.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
48
+ "model.layers.11.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
49
+ "model.layers.11.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
50
+ "model.layers.11.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
51
+ "model.layers.11.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
52
+ "model.layers.11.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
53
+ "model.layers.11.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
54
+ "model.layers.11.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
55
+ "model.layers.11.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
56
+ "model.layers.12.input_layernorm.weight": "model-00001-of-00002.safetensors",
57
+ "model.layers.12.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
58
+ "model.layers.12.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
59
+ "model.layers.12.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
60
+ "model.layers.12.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
61
+ "model.layers.12.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
62
+ "model.layers.12.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
63
+ "model.layers.12.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
64
+ "model.layers.12.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
65
+ "model.layers.12.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
66
+ "model.layers.12.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
67
+ "model.layers.12.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
68
+ "model.layers.13.input_layernorm.weight": "model-00001-of-00002.safetensors",
69
+ "model.layers.13.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
70
+ "model.layers.13.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
71
+ "model.layers.13.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
72
+ "model.layers.13.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
73
+ "model.layers.13.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
74
+ "model.layers.13.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
75
+ "model.layers.13.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
76
+ "model.layers.13.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
77
+ "model.layers.13.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
78
+ "model.layers.13.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
79
+ "model.layers.13.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
80
+ "model.layers.14.input_layernorm.weight": "model-00001-of-00002.safetensors",
81
+ "model.layers.14.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
82
+ "model.layers.14.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
83
+ "model.layers.14.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
84
+ "model.layers.14.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
85
+ "model.layers.14.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
86
+ "model.layers.14.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
87
+ "model.layers.14.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
88
+ "model.layers.14.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
89
+ "model.layers.14.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
90
+ "model.layers.14.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
91
+ "model.layers.14.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
92
+ "model.layers.15.input_layernorm.weight": "model-00001-of-00002.safetensors",
93
+ "model.layers.15.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
94
+ "model.layers.15.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
95
+ "model.layers.15.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
96
+ "model.layers.15.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
97
+ "model.layers.15.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
98
+ "model.layers.15.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
99
+ "model.layers.15.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
100
+ "model.layers.15.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
101
+ "model.layers.15.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
102
+ "model.layers.15.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
103
+ "model.layers.15.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
104
+ "model.layers.16.input_layernorm.weight": "model-00001-of-00002.safetensors",
105
+ "model.layers.16.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
106
+ "model.layers.16.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
107
+ "model.layers.16.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
108
+ "model.layers.16.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
109
+ "model.layers.16.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
110
+ "model.layers.16.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
111
+ "model.layers.16.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
112
+ "model.layers.16.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
113
+ "model.layers.16.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
114
+ "model.layers.16.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
115
+ "model.layers.16.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
116
+ "model.layers.17.input_layernorm.weight": "model-00001-of-00002.safetensors",
117
+ "model.layers.17.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
118
+ "model.layers.17.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
119
+ "model.layers.17.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
120
+ "model.layers.17.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
121
+ "model.layers.17.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
122
+ "model.layers.17.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
123
+ "model.layers.17.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
124
+ "model.layers.17.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
125
+ "model.layers.17.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
126
+ "model.layers.17.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
127
+ "model.layers.17.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
128
+ "model.layers.18.input_layernorm.weight": "model-00002-of-00002.safetensors",
129
+ "model.layers.18.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
130
+ "model.layers.18.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
131
+ "model.layers.18.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
132
+ "model.layers.18.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
133
+ "model.layers.18.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
134
+ "model.layers.18.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
135
+ "model.layers.18.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
136
+ "model.layers.18.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
137
+ "model.layers.18.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
138
+ "model.layers.18.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
139
+ "model.layers.18.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
140
+ "model.layers.19.input_layernorm.weight": "model-00002-of-00002.safetensors",
141
+ "model.layers.19.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
142
+ "model.layers.19.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
143
+ "model.layers.19.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
144
+ "model.layers.19.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
145
+ "model.layers.19.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
146
+ "model.layers.19.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
147
+ "model.layers.19.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
148
+ "model.layers.19.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
149
+ "model.layers.19.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
150
+ "model.layers.19.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
151
+ "model.layers.19.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
152
+ "model.layers.2.input_layernorm.weight": "model-00001-of-00002.safetensors",
153
+ "model.layers.2.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
154
+ "model.layers.2.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
155
+ "model.layers.2.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
156
+ "model.layers.2.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
157
+ "model.layers.2.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
158
+ "model.layers.2.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
159
+ "model.layers.2.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
160
+ "model.layers.2.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
161
+ "model.layers.2.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
162
+ "model.layers.2.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
163
+ "model.layers.2.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
164
+ "model.layers.20.input_layernorm.weight": "model-00002-of-00002.safetensors",
165
+ "model.layers.20.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
166
+ "model.layers.20.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
167
+ "model.layers.20.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
168
+ "model.layers.20.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
169
+ "model.layers.20.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
170
+ "model.layers.20.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
171
+ "model.layers.20.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
172
+ "model.layers.20.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
173
+ "model.layers.20.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
174
+ "model.layers.20.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
175
+ "model.layers.20.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
176
+ "model.layers.21.input_layernorm.weight": "model-00002-of-00002.safetensors",
177
+ "model.layers.21.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
178
+ "model.layers.21.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
179
+ "model.layers.21.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
180
+ "model.layers.21.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
181
+ "model.layers.21.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
182
+ "model.layers.21.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
183
+ "model.layers.21.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
184
+ "model.layers.21.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
185
+ "model.layers.21.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
186
+ "model.layers.21.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
187
+ "model.layers.21.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
188
+ "model.layers.22.input_layernorm.weight": "model-00002-of-00002.safetensors",
189
+ "model.layers.22.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
190
+ "model.layers.22.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
191
+ "model.layers.22.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
192
+ "model.layers.22.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
193
+ "model.layers.22.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
194
+ "model.layers.22.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
195
+ "model.layers.22.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
196
+ "model.layers.22.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
197
+ "model.layers.22.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
198
+ "model.layers.22.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
199
+ "model.layers.22.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
200
+ "model.layers.23.input_layernorm.weight": "model-00002-of-00002.safetensors",
201
+ "model.layers.23.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
202
+ "model.layers.23.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
203
+ "model.layers.23.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
204
+ "model.layers.23.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
205
+ "model.layers.23.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
206
+ "model.layers.23.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
207
+ "model.layers.23.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
208
+ "model.layers.23.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
209
+ "model.layers.23.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
210
+ "model.layers.23.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
211
+ "model.layers.23.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
212
+ "model.layers.24.input_layernorm.weight": "model-00002-of-00002.safetensors",
213
+ "model.layers.24.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
214
+ "model.layers.24.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
215
+ "model.layers.24.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
216
+ "model.layers.24.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
217
+ "model.layers.24.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
218
+ "model.layers.24.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
219
+ "model.layers.24.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
220
+ "model.layers.24.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
221
+ "model.layers.24.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
222
+ "model.layers.24.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
223
+ "model.layers.24.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
224
+ "model.layers.25.input_layernorm.weight": "model-00002-of-00002.safetensors",
225
+ "model.layers.25.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
226
+ "model.layers.25.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
227
+ "model.layers.25.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
228
+ "model.layers.25.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
229
+ "model.layers.25.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
230
+ "model.layers.25.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
231
+ "model.layers.25.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
232
+ "model.layers.25.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
233
+ "model.layers.25.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
234
+ "model.layers.25.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
235
+ "model.layers.25.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
236
+ "model.layers.26.input_layernorm.weight": "model-00002-of-00002.safetensors",
237
+ "model.layers.26.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
238
+ "model.layers.26.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
239
+ "model.layers.26.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
240
+ "model.layers.26.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
241
+ "model.layers.26.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
242
+ "model.layers.26.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
243
+ "model.layers.26.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
244
+ "model.layers.26.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
245
+ "model.layers.26.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
246
+ "model.layers.26.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
247
+ "model.layers.26.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
248
+ "model.layers.27.input_layernorm.weight": "model-00002-of-00002.safetensors",
249
+ "model.layers.27.mlp.down_proj.weight": "model-00002-of-00002.safetensors",
250
+ "model.layers.27.mlp.gate_proj.weight": "model-00002-of-00002.safetensors",
251
+ "model.layers.27.mlp.up_proj.weight": "model-00002-of-00002.safetensors",
252
+ "model.layers.27.post_attention_layernorm.weight": "model-00002-of-00002.safetensors",
253
+ "model.layers.27.self_attn.k_proj.bias": "model-00002-of-00002.safetensors",
254
+ "model.layers.27.self_attn.k_proj.weight": "model-00002-of-00002.safetensors",
255
+ "model.layers.27.self_attn.o_proj.weight": "model-00002-of-00002.safetensors",
256
+ "model.layers.27.self_attn.q_proj.bias": "model-00002-of-00002.safetensors",
257
+ "model.layers.27.self_attn.q_proj.weight": "model-00002-of-00002.safetensors",
258
+ "model.layers.27.self_attn.v_proj.bias": "model-00002-of-00002.safetensors",
259
+ "model.layers.27.self_attn.v_proj.weight": "model-00002-of-00002.safetensors",
260
+ "model.layers.3.input_layernorm.weight": "model-00001-of-00002.safetensors",
261
+ "model.layers.3.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
262
+ "model.layers.3.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
263
+ "model.layers.3.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
264
+ "model.layers.3.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
265
+ "model.layers.3.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
266
+ "model.layers.3.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
267
+ "model.layers.3.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
268
+ "model.layers.3.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
269
+ "model.layers.3.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
270
+ "model.layers.3.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
271
+ "model.layers.3.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
272
+ "model.layers.4.input_layernorm.weight": "model-00001-of-00002.safetensors",
273
+ "model.layers.4.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
274
+ "model.layers.4.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
275
+ "model.layers.4.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
276
+ "model.layers.4.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
277
+ "model.layers.4.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
278
+ "model.layers.4.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
279
+ "model.layers.4.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
280
+ "model.layers.4.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
281
+ "model.layers.4.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
282
+ "model.layers.4.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
283
+ "model.layers.4.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
284
+ "model.layers.5.input_layernorm.weight": "model-00001-of-00002.safetensors",
285
+ "model.layers.5.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
286
+ "model.layers.5.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
287
+ "model.layers.5.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
288
+ "model.layers.5.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
289
+ "model.layers.5.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
290
+ "model.layers.5.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
291
+ "model.layers.5.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
292
+ "model.layers.5.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
293
+ "model.layers.5.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
294
+ "model.layers.5.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
295
+ "model.layers.5.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
296
+ "model.layers.6.input_layernorm.weight": "model-00001-of-00002.safetensors",
297
+ "model.layers.6.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
298
+ "model.layers.6.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
299
+ "model.layers.6.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
300
+ "model.layers.6.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
301
+ "model.layers.6.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
302
+ "model.layers.6.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
303
+ "model.layers.6.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
304
+ "model.layers.6.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
305
+ "model.layers.6.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
306
+ "model.layers.6.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
307
+ "model.layers.6.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
308
+ "model.layers.7.input_layernorm.weight": "model-00001-of-00002.safetensors",
309
+ "model.layers.7.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
310
+ "model.layers.7.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
311
+ "model.layers.7.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
312
+ "model.layers.7.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
313
+ "model.layers.7.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
314
+ "model.layers.7.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
315
+ "model.layers.7.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
316
+ "model.layers.7.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
317
+ "model.layers.7.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
318
+ "model.layers.7.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
319
+ "model.layers.7.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
320
+ "model.layers.8.input_layernorm.weight": "model-00001-of-00002.safetensors",
321
+ "model.layers.8.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
322
+ "model.layers.8.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
323
+ "model.layers.8.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
324
+ "model.layers.8.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
325
+ "model.layers.8.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
326
+ "model.layers.8.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
327
+ "model.layers.8.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
328
+ "model.layers.8.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
329
+ "model.layers.8.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
330
+ "model.layers.8.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
331
+ "model.layers.8.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
332
+ "model.layers.9.input_layernorm.weight": "model-00001-of-00002.safetensors",
333
+ "model.layers.9.mlp.down_proj.weight": "model-00001-of-00002.safetensors",
334
+ "model.layers.9.mlp.gate_proj.weight": "model-00001-of-00002.safetensors",
335
+ "model.layers.9.mlp.up_proj.weight": "model-00001-of-00002.safetensors",
336
+ "model.layers.9.post_attention_layernorm.weight": "model-00001-of-00002.safetensors",
337
+ "model.layers.9.self_attn.k_proj.bias": "model-00001-of-00002.safetensors",
338
+ "model.layers.9.self_attn.k_proj.weight": "model-00001-of-00002.safetensors",
339
+ "model.layers.9.self_attn.o_proj.weight": "model-00001-of-00002.safetensors",
340
+ "model.layers.9.self_attn.q_proj.bias": "model-00001-of-00002.safetensors",
341
+ "model.layers.9.self_attn.q_proj.weight": "model-00001-of-00002.safetensors",
342
+ "model.layers.9.self_attn.v_proj.bias": "model-00001-of-00002.safetensors",
343
+ "model.layers.9.self_attn.v_proj.weight": "model-00001-of-00002.safetensors",
344
+ "model.norm.weight": "model-00002-of-00002.safetensors"
345
+ }
346
+ }
mucodec.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:339a16956b859a82defc02bfd32c3744d11ff942065f6ec9306dfd4400d62110
3
+ size 4704507596
muencoder.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e250df56b035f74c1f66f15133f4c78f664d70fa0b09aa9a752b7871bb58c02f
3
+ size 3957949089
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_prefix_space": false,
3
+ "added_tokens_decoder": {
4
+ "151643": {
5
+ "content": "<|endoftext|>",
6
+ "lstrip": false,
7
+ "normalized": false,
8
+ "rstrip": false,
9
+ "single_word": false,
10
+ "special": true
11
+ },
12
+ "151644": {
13
+ "content": "<|im_start|>",
14
+ "lstrip": false,
15
+ "normalized": false,
16
+ "rstrip": false,
17
+ "single_word": false,
18
+ "special": true
19
+ },
20
+ "151645": {
21
+ "content": "<|im_end|>",
22
+ "lstrip": false,
23
+ "normalized": false,
24
+ "rstrip": false,
25
+ "single_word": false,
26
+ "special": true
27
+ }
28
+ },
29
+ "additional_special_tokens": ["<|im_start|>", "<|im_end|>"],
30
+ "bos_token": null,
31
+ "chat_template": "{% for message in messages %}{% if loop.first and messages[0]['role'] != 'system' %}{{ '<|im_start|>system\nYou are a helpful assistant<|im_end|>\n' }}{% endif %}{{'<|im_start|>' + message['role'] + '\n' + message['content'] + '<|im_end|>' + '\n'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant\n' }}{% endif %}",
32
+ "clean_up_tokenization_spaces": false,
33
+ "eos_token": "<|endoftext|>",
34
+ "errors": "replace",
35
+ "model_max_length": 32768,
36
+ "pad_token": "<|endoftext|>",
37
+ "split_special_tokens": false,
38
+ "tokenizer_class": "Qwen2Tokenizer",
39
+ "unk_token": null
40
+ }
vocab.json ADDED
The diff for this file is too large to render. See raw diff