A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early‐onset monogenic disorders in Indians. Issue 4 (1st March 2021)
- Record Type:
- Journal Article
- Title:
- A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early‐onset monogenic disorders in Indians. Issue 4 (1st March 2021)
- Main Title:
- A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early‐onset monogenic disorders in Indians
- Authors:
- Kausthubham, Neethukrishna
Shukla, Anju
Gupta, Neerja
Bhavani, Gandham S.
Kulshrestha, Samarth
Das Bhowmik, Aneek
Moirangthem, Amita
Bijarnia‐Mahay, Sunita
Kabra, Madhulika
Puri, Ratna D.
Mandal, Kausik
Verma, Ishwar C.
Bielas, Stephanie L.
Phadke, Shubha R.
Dalal, Ashwin
Girisha, Katta M. - Abstract:
- Abstract: Given the genomic uniqueness, a local data set is most desired for Indians, who are underrepresented in existing public databases. We hypothesize patients with rare monogenic disorders and their family members can provide a reliable source of common variants in the population. Exome sequencing (ES) data from families with rare Mendelian disorders was aggregated from five centers in India. The dataset was refined by excluding related individuals and removing the disease‐causing variants (refined cohort). The efficiency of these data sets was assessed in a new set of 50 exomes against gnomAD and GenomeAsia. Our original cohort comprised 1455 individuals from 1203 families. The refined cohort had 836 unrelated individuals that retained 1, 251, 064 variants with 181, 125 population‐specific and 489, 618 common variants. The allele frequencies from our cohort helped to define 97, 609 rare variants in gnomAD and 44, 520 rare variants in GenomeAsia as common variants in our population. Our variant dataset provided an additional 1.7% and 0.1% efficiency for prioritizing heterozygous and homozygous variants respectively for rare monogenic disorders. We observed additional 19 genes/human knockouts. We list carrier frequency for 142 recessive disorders. This is a large and useful resource of exonic variants for Indians. Despite limitations, datasets from patients are efficient tools for variant prioritization in a resource‐limited setting. Abstract : This study highlights theAbstract: Given the genomic uniqueness, a local data set is most desired for Indians, who are underrepresented in existing public databases. We hypothesize patients with rare monogenic disorders and their family members can provide a reliable source of common variants in the population. Exome sequencing (ES) data from families with rare Mendelian disorders was aggregated from five centers in India. The dataset was refined by excluding related individuals and removing the disease‐causing variants (refined cohort). The efficiency of these data sets was assessed in a new set of 50 exomes against gnomAD and GenomeAsia. Our original cohort comprised 1455 individuals from 1203 families. The refined cohort had 836 unrelated individuals that retained 1, 251, 064 variants with 181, 125 population‐specific and 489, 618 common variants. The allele frequencies from our cohort helped to define 97, 609 rare variants in gnomAD and 44, 520 rare variants in GenomeAsia as common variants in our population. Our variant dataset provided an additional 1.7% and 0.1% efficiency for prioritizing heterozygous and homozygous variants respectively for rare monogenic disorders. We observed additional 19 genes/human knockouts. We list carrier frequency for 142 recessive disorders. This is a large and useful resource of exonic variants for Indians. Despite limitations, datasets from patients are efficient tools for variant prioritization in a resource‐limited setting. Abstract : This study highlights the utility of pooling exome sequencing data from families suspected with rare disorders in a population that is underrepresented in public variant repositories. Despite inherent limitations of the dataset, the challenge of a resource limited setting combined with the absence of an 'ideal population specific cohort', a variant dataset derived from clinical and research exomes are likely to be efficient for variant prioritization for early‐onset monogenic disorders in India or people of Indian origin. To enable a clearer prioritization of disease‐causing variants, the aggregated cohort was refined by discounting the disease causing variants and removing any related individuals and thus we catalogued a total of 1.25 million variants, of which 181, 000 variants are population specific. … (more)
- Is Part Of:
- Human mutation. Volume 42:Issue 4(2021)
- Journal:
- Human mutation
- Issue:
- Volume 42:Issue 4(2021)
- Issue Display:
- Volume 42, Issue 4 (2021)
- Year:
- 2021
- Volume:
- 42
- Issue:
- 4
- Issue Sort Value:
- 2021-0042-0004-0000
- Page Start:
- e15
- Page End:
- e61
- Publication Date:
- 2021-03-01
- Subjects:
- exomes -- Indian population -- monogenic disorders -- variant dataset
Human chromosome abnormalities -- Periodicals
Mutation (Biology) -- Periodicals
616.04205 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-1004 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/humu.24172 ↗
- Languages:
- English
- ISSNs:
- 1059-7794
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4336.217000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23395.xml