"ENVIRONMENTAL SOUND CLASSIFICATION" (Using Deep Learning Models- LSTM, BiLSTM, GRU, CNN)

PRAFUL MAN THAKU (770326) ROCKY SHRESTHA (770331) SAURAV BASUKALA (770339) SOHAN BASNET (770344)

Please use this identifier to cite or link to this item: https://elibrary.khec.edu.np/handle/123456789/1013

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Er. Milan Chikanbanjar	-
dc.contributor.author	PRAFUL MAN THAKU (770326) ROCKY SHRESTHA (770331) SAURAV BASUKALA (770339) SOHAN BASNET (770344)	-
dc.date.accessioned	2025-09-15T12:24:30Z	-
dc.date.available	2025-09-15T12:24:30Z	-
dc.date.issued	2025	-
dc.identifier.uri	https://elibrary.khec.edu.np/handle/123456789/1013	-
dc.description.abstract	This report suggests a powerful Environmental Sound Classification (ESC) system using Convolutional Recurrent Neural Networks (CRNN) that use convolutional layers to learn spatial features from the spectrogram representation of sounds and recurrent layers for modeling temporal patterns in sounds. UrbanSound8K dataset, which contains 8,732 sound samples in 10 urban sound classes, was augmented using data augmentation techniques such as shifting, time stretching, and noise addition to 15,000 samples. Mel-Frequency Cepstral Coefficients (MFCCs) were extracted to analyze sound data. Four deep learning architectures; Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and Bidirectional LSTM (BiLSTM) were trained and evaluated. Of these, BiLSTM performed the best with 99.84% training accuracy, 93.75% validation accuracy, and a macro F1-score of 0.92. CNN followed with 99.89% and 92.83% training and validation accuracy, respectively. GRU (97.00%/91.00%) exhibited quick convergence and minimal overfitting, hence being perfect for light-weighted applications, while LSTM had 97.27%/90.58%. The results reveal the phenomenal ability of CRNN-based approaches, especially BiLSTM, in offering trustworthy, precise, and versatile sound perception in intelligent, sound-responsive systems.	-
dc.format.extent	66 p	-
dc.subject	Environmental Sound Classification, Deep Learning, CRNN, LSTM, GRU, BiLSTM, CNN, MFCC, Augmentation, UrbanSound8K.	-
dc.title	"ENVIRONMENTAL SOUND CLASSIFICATION" (Using Deep Learning Models- LSTM, BiLSTM, GRU, CNN)	-
dc.type	Report	-
local.college.name	Khwopa Engineering College	-
local.degree.department	Department of Computer Engineering	-
local.college.batch	2077	-
local.degree.name	BE Computer	-
local.degree.level	Bachelor's Degree	-
local.item.accessionnumber	D.1551	-
Appears in Collections:	PU Computer Report

Files in This Item:

File	Size	Format
Environmental Sound Classification.pdf Restricted Access	1.5 MB	Adobe PDF	View/Open Request a copy

Show simple item record