--- license: cc-by-nc-sa-4.0 pipeline_tag: audio-classification tags: - autrainer - audio - orthoptera-tagging - HearTheSpecies --- # InsectNet for the Biodiversity Exploratories Model that tags audio files as belonging to one or more of 29 (t.b.d. below) prevalent Orthoptera species within the Biodiversity Exploratories. We also have a Silence, Buzz, and Bird tag, but these predictions should be ignored and are only incorporated for the training. # Installation To use the model, you have to install autrainer, e.g. via pip: ``` pip install autrainer ``` This model has been trained and tested with autrainer version `0.6.0`. For more information about autrainer, please refer to: https://autrainer.github.io/autrainer/index.html # Usage The model can be applied on all wav files present in a folder (``) and stored in another folder (``): ``` autrainer inference hf:AlexanderGbd/InsectNetLocal -r -w 4 -s 4 -sr 96000 ``` , where `-w` is the window size in seconds, `-s` is the step size in seconds and `-sr` is the sampling rate. For other possible inference settings and all usable parameters, please have a look at the autrainer documentation. However, the above settings are recommended. ## Training ### Pretraining TODO ### Dataset TODO ### Features The audio recordings were resampled to 96kHz, as we wanted to avoid losing too much frequency information from the species. Log-Mel spectrograms were then extracted using torchlibrosa. ### Training process The model has been trained for 30 epochs. At the end of each epoch, the model was evaluated on our validation set. We release the state that achieved the best performance on this validation set. All training hyperparameters can be found inside `conf/config.yaml` inside the model folder. ## Evaluation The performance on the test set reached a (macro) f1-score of 0.70. ## Acknowledgments TODO Please acknowledge the work which produced the original model. We would appreciate an acknowledgment to autrainer.