Please use this identifier to cite or link to this item: https://elibrary.khec.edu.np/handle/123456789/1013
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorEr. Milan Chikanbanjar-
dc.contributor.authorPRAFUL MAN THAKU (770326) ROCKY SHRESTHA (770331) SAURAV BASUKALA (770339) SOHAN BASNET (770344)-
dc.date.accessioned2025-09-15T12:24:30Z-
dc.date.available2025-09-15T12:24:30Z-
dc.date.issued2025-
dc.identifier.urihttps://elibrary.khec.edu.np/handle/123456789/1013-
dc.description.abstractThis report suggests a powerful Environmental Sound Classification (ESC) system using Convolutional Recurrent Neural Networks (CRNN) that use convolutional layers to learn spatial features from the spectrogram representation of sounds and recurrent layers for modeling temporal patterns in sounds. UrbanSound8K dataset, which contains 8,732 sound samples in 10 urban sound classes, was augmented using data augmentation techniques such as shifting, time stretching, and noise addition to 15,000 samples. Mel-Frequency Cepstral Coefficients (MFCCs) were extracted to analyze sound data. Four deep learning architectures; Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and Bidirectional LSTM (BiLSTM) were trained and evaluated. Of these, BiLSTM performed the best with 99.84% training accuracy, 93.75% validation accuracy, and a macro F1-score of 0.92. CNN followed with 99.89% and 92.83% training and validation accuracy, respectively. GRU (97.00%/91.00%) exhibited quick convergence and minimal overfitting, hence being perfect for light-weighted applications, while LSTM had 97.27%/90.58%. The results reveal the phenomenal ability of CRNN-based approaches, especially BiLSTM, in offering trustworthy, precise, and versatile sound perception in intelligent, sound-responsive systems.-
dc.format.extent66 p-
dc.subjectEnvironmental Sound Classification, Deep Learning, CRNN, LSTM, GRU, BiLSTM, CNN, MFCC, Augmentation, UrbanSound8K.-
dc.title"ENVIRONMENTAL SOUND CLASSIFICATION" (Using Deep Learning Models- LSTM, BiLSTM, GRU, CNN)-
dc.typeReport-
local.college.nameKhwopa Engineering College-
local.degree.departmentDepartment of Computer Engineering-
local.college.batch2077-
local.degree.nameBE Computer-
local.degree.levelBachelor's Degree-
local.item.accessionnumberD.1551-
Appears in Collections:PU Computer Report

Files in This Item:
File SizeFormat 
Environmental Sound Classification.pdf
  Restricted Access
1.5 MBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.