{"id":13454313,"identifier":"DVN/YVWOAL","persistentUrl":"https://doi.org/10.7910/DVN/YVWOAL","protocol":"doi","authority":"10.7910","separator":"/","publisher":"Harvard Dataverse","publicationDate":"2026-02-11","storageIdentifier":"s3://10.7910/DVN/YVWOAL","datasetType":"dataset","datasetVersion":{"id":612190,"datasetId":13454313,"datasetPersistentId":"doi:10.7910/DVN/YVWOAL","datasetType":"dataset","storageIdentifier":"s3://10.7910/DVN/YVWOAL","versionNumber":1,"internalVersionNumber":6,"versionMinorNumber":0,"versionState":"RELEASED","latestVersionPublishingState":"RELEASED","lastUpdateTime":"2026-02-11T17:10:11Z","releaseTime":"2026-02-11T17:10:11Z","createTime":"2026-02-11T13:39:46Z","publicationDate":"2026-02-11","citationDate":"2026-02-11","license":{"name":"CC0 1.0","uri":"http://creativecommons.org/publicdomain/zero/1.0","iconUri":"https://licensebuttons.net/p/zero/1.0/88x31.png","rightsIdentifier":"CC0-1.0","rightsIdentifierScheme":"SPDX","schemeUri":"https://spdx.org/licenses/","languageCode":"en"},"fileAccessRequest":true,"metadataBlocks":{"citation":{"displayName":"Citation Metadata","name":"citation","fields":[{"typeName":"title","multiple":false,"typeClass":"primitive","value":"BAS4R: A Multi-Condition Bangla Speech Dataset for Gender-Aware Real and Fake Voice Analysis"},{"typeName":"author","multiple":true,"typeClass":"compound","value":[{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Al Arian Ahmad"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"https://ror.org/01vxg3438"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"https://orcid.org/0009-0009-6344-0189","expandedvalue":{"@id":"https://orcid.org/0009-0009-6344-0189","scheme":"ORCID","@type":"https://schema.org/Person"}}},{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Turzo, Nakib"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"https://ror.org/01vxg3438"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"https://orcid.org/0000-0002-4275-2636","expandedvalue":{"personName":"Turzo, Nakib","@id":"https://orcid.org/0000-0002-4275-2636","scheme":"ORCID","@type":"https://schema.org/Person"}}},{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Durjoy Kumar Dutta"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"https://ror.org/01vxg3438"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"https://orcid.org/0009-0000-1800-7508","expandedvalue":{"@id":"https://orcid.org/0009-0000-1800-7508","scheme":"ORCID","@type":"https://schema.org/Person"}}},{"authorName":{"typeName":"authorName","multiple":false,"typeClass":"primitive","value":"Wadud, Abdul Wadud"},"authorAffiliation":{"typeName":"authorAffiliation","multiple":false,"typeClass":"primitive","value":"https://ror.org/01vxg3438"},"authorIdentifierScheme":{"typeName":"authorIdentifierScheme","multiple":false,"typeClass":"controlledVocabulary","value":"ORCID"},"authorIdentifier":{"typeName":"authorIdentifier","multiple":false,"typeClass":"primitive","value":"https://orcid.org/0009-0000-5181-6964","expandedvalue":{"personName":"Wadud, Abdul Wadud","@id":"https://orcid.org/0009-0000-5181-6964","scheme":"ORCID","@type":"https://schema.org/Person"}}}]},{"typeName":"datasetContact","multiple":true,"typeClass":"compound","value":[{"datasetContactName":{"typeName":"datasetContactName","multiple":false,"typeClass":"primitive","value":"Al Arian Ahmad"},"datasetContactAffiliation":{"typeName":"datasetContactAffiliation","multiple":false,"typeClass":"primitive","value":"Pabna University of Science and Technology"},"datasetContactEmail":{"typeName":"datasetContactEmail","multiple":false,"typeClass":"primitive","value":"arian.cse.pust@gmail.com"}}]},{"typeName":"dsDescription","multiple":true,"typeClass":"compound","value":[{"dsDescriptionValue":{"typeName":"dsDescriptionValue","multiple":false,"typeClass":"primitive","value":"BAS4R is a structured and large-scale Bangla speech dataset developed to support research in replay attack detection and audio spoofing analysis within voice biometric systems. The dataset contains both authentic (real) and systematically manipulated (spoofed) speech recordings collected under controlled and realistic acoustic conditions.\n\nThe complete dataset comprises 143.88 hours of audio recordings, totaling 120,125 audio files, organized into five major categories:\n\nChannel-based: 28,830 files (34.65 hours)\n\nSignal Processing-based: 28,830 files (34.53 hours)\n\nEffect-based: 28,830 files (34.48 hours)\n\nReplay-based: 28,830 files (34.48 hours)\n\nReal Data: 4,805 files (5.75 hours)\n\nSpeech samples were collected from 100 native Bangla speakers (50 male and 50 female) aged 20–26 years, ensuring balanced gender representation and demographic consistency. All recordings were captured in controlled environments and stored in high-quality digital audio format.\n\nThe dataset follows a structured hierarchical organization separating real and spoofed samples by category and attack condition, facilitating reproducible research. The spoofed data were generated using real signal processing techniques, channel transmission effects, environmental distortions, and replay setups.\n\nBAS4R is suitable for research in anti-spoofing systems, speaker verification robustness evaluation, replay attack detection, and deep learning–based audio classification."},"dsDescriptionDate":{"typeName":"dsDescriptionDate","multiple":false,"typeClass":"primitive","value":"2026-02-11"}}]},{"typeName":"subject","multiple":true,"typeClass":"controlledVocabulary","value":["Computer and Information Science","Engineering"]},{"typeName":"depositor","multiple":false,"typeClass":"primitive","value":"Al Arian Ahmad"},{"typeName":"dateOfDeposit","multiple":false,"typeClass":"primitive","value":"2026-02-11"}]}},"files":[{"label":"Channel-based.zip","restricted":false,"version":1,"datasetVersionId":612190,"dataFile":{"id":13454317,"persistentId":"","filename":"Channel-based.zip","contentType":"application/zip","friendlyType":"ZIP Archive","filesize":2307531565,"storageIdentifier":"s3://dvn-cloud:19c4c6d9bbb-0801086541cd","rootDataFileId":-1,"md5":"785c8fed3f509cbe7e336099c3f78a2b","checksum":{"type":"MD5","value":"785c8fed3f509cbe7e336099c3f78a2b"},"tabularData":false,"creationDate":"2026-02-11","publicationDate":"2026-02-11","lastUpdateTime":"2026-02-11T17:10:11Z","fileAccessRequest":true}},{"label":"Effect-based.zip","restricted":false,"version":1,"datasetVersionId":612190,"dataFile":{"id":13454315,"persistentId":"","filename":"Effect-based.zip","contentType":"application/zip","friendlyType":"ZIP Archive","filesize":2650879946,"storageIdentifier":"s3://dvn-cloud:19c4c82be22-3d33c9d8e09c","rootDataFileId":-1,"md5":"bd84c6354fba77dd85c9424636cd39de","checksum":{"type":"MD5","value":"bd84c6354fba77dd85c9424636cd39de"},"tabularData":false,"creationDate":"2026-02-11","publicationDate":"2026-02-11","lastUpdateTime":"2026-02-11T17:10:11Z","fileAccessRequest":true}},{"label":"Real Data.zip","restricted":false,"version":1,"datasetVersionId":612190,"dataFile":{"id":13454318,"persistentId":"","filename":"Real Data.zip","contentType":"application/zip","friendlyType":"ZIP Archive","filesize":2312542676,"storageIdentifier":"s3://dvn-cloud:19c4c560e4d-42b9a1e5dafc","rootDataFileId":-1,"md5":"88bb090827317d2d4595ad4196aa41d1","checksum":{"type":"MD5","value":"88bb090827317d2d4595ad4196aa41d1"},"tabularData":false,"creationDate":"2026-02-11","publicationDate":"2026-02-11","lastUpdateTime":"2026-02-11T17:10:11Z","fileAccessRequest":true}},{"label":"Replay-based.zip","restricted":false,"version":1,"datasetVersionId":612190,"dataFile":{"id":13454316,"persistentId":"","filename":"Replay-based.zip","contentType":"application/zip","friendlyType":"ZIP Archive","filesize":2656898557,"storageIdentifier":"s3://dvn-cloud:19c4c956bef-849d9679e49a","rootDataFileId":-1,"md5":"6194c0953efa43778671b6ee6a199e3c","checksum":{"type":"MD5","value":"6194c0953efa43778671b6ee6a199e3c"},"tabularData":false,"creationDate":"2026-02-11","publicationDate":"2026-02-11","lastUpdateTime":"2026-02-11T17:10:11Z","fileAccessRequest":true}},{"label":"Signal Processing.zip","restricted":false,"version":1,"datasetVersionId":612190,"dataFile":{"id":13454314,"persistentId":"","filename":"Signal Processing.zip","contentType":"application/zip","friendlyType":"ZIP Archive","filesize":2584858383,"storageIdentifier":"s3://dvn-cloud:19c4ca7fb30-6098aa7cbb7f","rootDataFileId":-1,"md5":"9a095e9f5385629629e99993895b9a71","checksum":{"type":"MD5","value":"9a095e9f5385629629e99993895b9a71"},"tabularData":false,"creationDate":"2026-02-11","publicationDate":"2026-02-11","lastUpdateTime":"2026-02-11T17:10:11Z","fileAccessRequest":true}}],"citation":"Al Arian Ahmad; Turzo, Nakib; Durjoy Kumar Dutta; Wadud, Abdul Wadud, 2026, \"BAS4R: A Multi-Condition Bangla Speech Dataset for Gender-Aware Real and Fake Voice Analysis\", https://doi.org/10.7910/DVN/YVWOAL, Harvard Dataverse, V1"}}