TY - JOUR
T1 - Large-scale annotated dataset for cochlear hair cell detection and classification
AU - Buswinka, Christopher J.
AU - Rosenberg, David B.
AU - Simikyan, Rubina G.
AU - Osgood, Richard T.
AU - Fernandez, Katharine
AU - Nitta, Hidetomi
AU - Hayashi, Yushi
AU - Liberman, Leslie W.
AU - Nguyen, Emily
AU - Yildiz, Erdem
AU - Kim, Jinkyung
AU - Jarysta, Amandine
AU - Renauld, Justine
AU - Wesson, Ella
AU - Wang, Haobing
AU - Thapa, Punam
AU - Bordiga, Pierrick
AU - McMurtry, Noah
AU - Llamas, Juan
AU - Kitcher, Siân R.
AU - López-Porras, Ana I.
AU - Cui, Runjia
AU - Behnammanesh, Ghazaleh
AU - Bird, Jonathan E.
AU - Ballesteros, Angela
AU - Vélez-Ortega, A. Catalina
AU - Edge, Albert S.B.
AU - Deans, Michael R.
AU - Gnedeva, Ksenia
AU - Shrestha, Brikha R.
AU - Manor, Uri
AU - Zhao, Bo
AU - Ricci, Anthony J.
AU - Tarchini, Basile
AU - Basch, Martín L.
AU - Stepanyan, Ruben
AU - Landegger, Lukas D.
AU - Rutherford, Mark
AU - Liberman, M. Charles
AU - Walters, Bradley J.
AU - Kros, Corné J.
AU - Richardson, Guy P.
AU - Cunningham, Lisa L.
AU - Indzhykulian, Artur A.
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/12
Y1 - 2024/12
N2 - Our sense of hearing is mediated by cochlear hair cells, of which there are two types organized in one row of inner hair cells and three rows of outer hair cells. Each cochlea contains 5–15 thousand terminally differentiated hair cells, and their survival is essential for hearing as they do not regenerate after insult. It is often desirable in hearing research to quantify the number of hair cells within cochlear samples, in both pathological conditions, and in response to treatment. Machine learning can be used to automate the quantification process but requires a vast and diverse dataset for effective training. In this study, we present a large collection of annotated cochlear hair-cell datasets, labeled with commonly used hair-cell markers and imaged using various fluorescence microscopy techniques. The collection includes samples from mouse, rat, guinea pig, pig, primate, and human cochlear tissue, from normal conditions and following in-vivo and in-vitro ototoxic drug application. The dataset includes over 107,000 hair cells which have been identified and annotated as either inner or outer hair cells. This dataset is the result of a collaborative effort from multiple laboratories and has been carefully curated to represent a variety of imaging techniques. With suggested usage parameters and a well-described annotation procedure, this collection can facilitate the development of generalizable cochlear hair-cell detection models or serve as a starting point for fine-tuning models for other analysis tasks. By providing this dataset, we aim to give other hearing research groups the opportunity to develop their own tools with which to analyze cochlear imaging data more fully, accurately, and with greater ease.
AB - Our sense of hearing is mediated by cochlear hair cells, of which there are two types organized in one row of inner hair cells and three rows of outer hair cells. Each cochlea contains 5–15 thousand terminally differentiated hair cells, and their survival is essential for hearing as they do not regenerate after insult. It is often desirable in hearing research to quantify the number of hair cells within cochlear samples, in both pathological conditions, and in response to treatment. Machine learning can be used to automate the quantification process but requires a vast and diverse dataset for effective training. In this study, we present a large collection of annotated cochlear hair-cell datasets, labeled with commonly used hair-cell markers and imaged using various fluorescence microscopy techniques. The collection includes samples from mouse, rat, guinea pig, pig, primate, and human cochlear tissue, from normal conditions and following in-vivo and in-vitro ototoxic drug application. The dataset includes over 107,000 hair cells which have been identified and annotated as either inner or outer hair cells. This dataset is the result of a collaborative effort from multiple laboratories and has been carefully curated to represent a variety of imaging techniques. With suggested usage parameters and a well-described annotation procedure, this collection can facilitate the development of generalizable cochlear hair-cell detection models or serve as a starting point for fine-tuning models for other analysis tasks. By providing this dataset, we aim to give other hearing research groups the opportunity to develop their own tools with which to analyze cochlear imaging data more fully, accurately, and with greater ease.
UR - http://www.scopus.com/inward/record.url?scp=85191196311&partnerID=8YFLogxK
U2 - 10.1038/s41597-024-03218-y
DO - 10.1038/s41597-024-03218-y
M3 - Article
C2 - 38653806
AN - SCOPUS:85191196311
SN - 2052-4463
VL - 11
JO - Scientific data
JF - Scientific data
IS - 1
M1 - 416
ER -