Kaikki aineistot
Lisää
Abstract Background: Both statins and proprotein convertase subtilisin/kexin type 9 (PCSK9) inhibitors lower blood low-density lipoprotein cholesterol levels to reduce risk of cardiovascular events. To assess potential differences between metabolic effects of these 2 lipid-lowering therapies, we performed detailed lipid and metabolite profiling of a large randomized statin trial and compared the results with the effects of genetic inhibition of PCSK9, acting as a naturally occurring trial. Methods: Two hundred twenty-eight circulating metabolic measures were quantified by nuclear magnetic resonance spectroscopy, including lipoprotein subclass concentrations and their lipid composition, fatty acids, and amino acids, for 5359 individuals (2659 on treatment) in the PROSPER (Prospective Study of Pravastatin in the Elderly at Risk) trial at 6 months postrandomization. The corresponding metabolic measures were analyzed in 8 population cohorts (N=72 185) using PCSK9 rs11591147 as an unconfounded proxy to mimic the therapeutic effects of PCSK9 inhibitors. Results: Scaled to an equivalent lowering of low-density lipoprotein cholesterol, the effects of genetic inhibition of PCSK9 on 228 metabolic markers were generally consistent with those of statin therapy (R2=0.88). Alterations in lipoprotein lipid composition and fatty acid distribution were similar. However, discrepancies were observed for very-low-density lipoprotein lipid measures. For instance, genetic inhibition of PCSK9 had weaker effects on lowering of very-low-density lipoprotein cholesterol compared with statin therapy (54% versus 77% reduction, relative to the lowering effect on low-density lipoprotein cholesterol; P=2×10−7 for heterogeneity). Genetic inhibition of PCSK9 showed no significant effects on amino acids, ketones, or a marker of inflammation (GlycA), whereas statin treatment weakly lowered GlycA levels. Conclusions: Genetic inhibition of PCSK9 had similar metabolic effects to statin therapy on detailed lipid and metabolite profiles. However, PCSK9 inhibitors are predicted to have weaker effects on very-low-density lipoprotein lipids compared with statins for an equivalent lowering of low-density lipoprotein cholesterol, which potentially translate into smaller reductions in cardiovascular disease risk.
Abstract In this study we aim to examine gene–environment interactions (GxEs) between genes involved with estrogen metabolism and environmental factors related to estrogen exposure. GxE analyses were conducted with 1970 Korean breast cancer cases and 2052 controls in the case-control study, the Seoul Breast Cancer Study (SEBCS). A total of 11,555 SNPs from the 137 candidate genes were included in the GxE analyses with eight established environmental factors. A replication test was conducted by using an independent population from the Breast Cancer Association Consortium (BCAC), with 62,485 Europeans and 9047 Asians. The GxE tests were performed by using two-step methods in GxEScan software. Two interactions were found in the SEBCS. The first interaction was shown between rs13035764 of NCOA1 and age at menarche in the GE|2df model (p-2df = 1.2 × 10−3). The age at menarche before 14 years old was associated with the high risk of breast cancer, and the risk was higher when subjects had homozygous minor allele G. The second GxE was shown between rs851998 near ESR1 and height in the GE|2df model (p-2df = 1.1 × 10−4). Height taller than 160 cm was associated with a high risk of breast cancer, and the risk increased when the minor allele was added. The findings were not replicated in the BCAC. These results would suggest specificity in Koreans for breast cancer risk.
Abstract Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 x 10-8), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution.
Abstract A combination of genetic and functional approaches has identified three independent breast cancer risk loci at 2q35. A recent fine-scale mapping analysis to refine these associations resulted in 1 (signal 1), 5 (signal 2), and 42 (signal 3) credible causal variants at these loci. We used publicly available in silico DNase I and ChIP-seq data with in vitro reporter gene and CRISPR assays to annotate signals 2 and 3. We identified putative regulatory elements that enhanced cell-type-specific transcription from the IGFBP5 promoter at both signals (30- to 40-fold increased expression by the putative regulatory element at signal 2, 2- to 3-fold by the putative regulatory element at signal 3). We further identified one of the five credible causal variants at signal 2, a 1.4 kb deletion (esv3594306), as the likely causal variant; the deletion allele of this variant was associated with an average additional increase in IGFBP5 expression of 1.3-fold (MCF-7) and 2.2-fold (T-47D). We propose a model in which the deletion allele of esv3594306 juxtaposes two transcription factor binding regions (annotated by estrogen receptor alpha ChIP-seq peaks) to generate a single extended regulatory element. This regulatory element increases cell-type-specific expression of the tumor suppressor gene IGFBP5 and, thereby, reduces risk of estrogen receptor-positive breast cancer (odds ratio = 0.77, 95% CI 0.74–0.81, p = 3.1 × 10−31).
Abstract Autosomal genetic analyses of blood lipids have yielded key insights for coronary heart disease (CHD). However, X chromosome genetic variation is understudied for blood lipids in large sample sizes. We now analyze genetic and blood lipid data in a high-coverage whole X chromosome sequencing study of 65,322 multi-ancestry participants and perform replication among 456,893 European participants. Common alleles on chromosome Xq23 are strongly associated with reduced total cholesterol, LDL cholesterol, and triglycerides (min P = 8.5 × 10−72), with similar effects for males and females. Chromosome Xq23 lipid-lowering alleles are associated with reduced odds for CHD among 42,545 cases and 591,247 controls (P = 1.7 × 10−4), and reduced odds for diabetes mellitus type 2 among 54,095 cases and 573,885 controls (P = 1.4 × 10−5). Although we observe an association with increased BMI, waist-to-hip ratio adjusted for BMI is reduced, bioimpedance analyses indicate increased gluteofemoral fat, and abdominal MRI analyses indicate reduced visceral adiposity. Co-localization analyses strongly correlate increased CHRDL1 gene expression, particularly in adipose tissue, with reduced concentrations of blood lipids.
Abstract Increased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use1. Despite advances in prevention and treatment, in particular through reducing low-density lipoprotein cholesterol levels2, heart disease remains the leading cause of death worldwide3. Genome-wideassociation studies (GWAS) of blood lipid levels have led to important biological and clinical insights, as well as new drug targets, for cardiovascular disease. However, most previous GWAS4‐23 have been conducted in European ancestry populations and may have missed genetic variants that contribute to lipid-level variation in other ancestry groups. These include differences in allele frequencies, effect sizes and linkage-disequilibrium patterns24. Here we conduct a multi-ancestry, genome-wide genetic discovery meta-analysis of lipid levels in approximately 1.65 million individuals, including 350,000 of non-European ancestries. We quantify the gain in studying non-European ancestries and provide evidence to support the expansion of recruitment of additional ancestries, even with relatively small sample sizes. We find that increasing diversity rather than studying additional individuals of European ancestry results in substantial improvements in fine-mapping functional variants and portability of polygenic prediction (evaluated in approximately 295,000 individuals from 7 ancestry groupings). Modest gains in the number of discovered loci and ancestry-specific variants were also achieved. As GWAS expand emphasis beyond the identification of genes and fundamental biology towards the use of genetic variants for preventive and precision medicine25, we anticipate that increased diversity of participants will lead to more accurate and equitable26 application of polygenic scores in clinical practice.
Abstract In many species, the offspring of related parents suffer reduced reproductive success, a phenomenon known as inbreeding depression. In humans, the importance of this effect has remained unclear, partly because reproduction between close relatives is both rare and frequently associated with confounding social factors. Here, using genomic inbreeding coefficients (FROH) for >1.4 million individuals, we show that FROH is significantly associated (p < 0.0005) with apparently deleterious changes in 32 out of 100 traits analysed. These changes are associated with runs of homozygosity (ROH), but not with common variant homozygosity, suggesting that genetic variants associated with inbreeding depression are predominantly rare. The effect on fertility is striking: FROH equivalent to the offspring of first cousins is associated with a 55% decrease [95% CI 44–66%] in the odds of having children. Finally, the effects of FROH are confirmed within full-sibling pairs, where the variation in FROH is independent of all environmental confounding.
Abstract Background: The rarity of mutations in PALB2, CHEK2 and ATM make it difficult to estimate precisely associated cancer risks. Population-based family studies have provided evidence that at least some of these mutations are associated with breast cancer risk as high as those associated with rare BRCA2 mutations. We aimed to estimate the relative risks associated with specific rare variants in PALB2, CHEK2 and ATM via a multicentre case-control study. Methods: We genotyped 10 rare mutations using the custom iCOGS array: PALB2 c.1592delT, c.2816T>G and c.3113G>A, CHEK2 c.349A>G, c.538C>T, c.715G>A, c.1036C>T, c.1312G>T, and c.1343T>G and ATM c.7271T>G. We assessed associations with breast cancer risk (42 671 cases and 42 164 controls), as well as prostate (22 301 cases and 22 320 controls) and ovarian (14 542 cases and 23 491 controls) cancer risk, for each variant. Results: For European women, strong evidence of association with breast cancer risk was observed for PALB2 c.1592delT OR 3.44 (95% CI 1.39 to 8.52, p = 7.1 × 10−5), PALB2 c.3113G>A OR 4.21 (95% CI 1.84 to 9.60, p = 6.9 × 10−8) and ATM c.7271T>G OR 11.0 (95% CI 1.42 to 85.7, p = 0.0012). We also found evidence of association with breast cancer risk for three variants in CHEK2, c.349A>G OR 2.26 (95% CI 1.29 to 3.95), c.1036C>T OR 5.06 (95% CI 1.09 to 23.5) and c.538C>T OR 1.33 (95% CI 1.05 to 1.67) (p ≤ 0.017). Evidence for prostate cancer risk was observed for CHEK2 c.1343T>G OR 3.03 (95% CI 1.53 to 6.03, p = 0.0006) for African men and CHEK2 c.1312G>T OR 2.21 (95% CI 1.06 to 4.63, p = 0.030) for European men. No evidence of association with ovarian cancer was found for any of these variants. Conclusions: This report adds to accumulating evidence that at least some variants in these genes are associated with an increased risk of breast cancer that is clinically important.
Abstract To dissect the genetic architecture of blood pressure and assess effects on target organ damage, we analyzed 128,272 SNPs from targeted and genome-wide arrays in 201,529 individuals of European ancestry, and genotypes from an additional 140,886 individuals were used for validation. We identified 66 blood pressure–associated loci, of which 17 were new; 15 harbored multiple distinct association signals. The 66 index SNPs were enriched for cis-regulatory elements, particularly in vascular endothelial cells, consistent with a primary role in blood pressure control through modulation of vascular tone across multiple tissues. The 66 index SNPs combined in a risk score showed comparable effects in 64,421 individuals of non-European descent. The 66-SNP blood pressure risk score was significantly associated with target organ damage in multiple tissues but with minor effects in the kidney. Our findings expand current knowledge of blood pressure–related pathways and highlight tissues beyond the classical renal system in blood pressure regulation.
Abstract Background: Although high-density lipoprotein (HDL) and non-HDL cholesterol have opposite associations with coronary heart disease, multi-country reports of lipid trends only use total cholesterol (TC). Our aim was to compare trends in total, HDL and non-HDL cholesterol and the total-to-HDL cholesterol ratio in Asian and Western countries. Methods: We pooled 458 population-based studies with 82.1 million participants in 23 Asian and Western countries. We estimated changes in mean total, HDL and non-HDL cholesterol and mean total-to-HDL cholesterol ratio by country, sex and age group. Results: Since ∼1980, mean TC increased in Asian countries. In Japan and South Korea, the TC rise was due to rising HDL cholesterol, which increased by up to 0.17 mmol/L per decade in Japanese women; in China, it was due to rising non-HDL cholesterol. TC declined in Western countries, except in Polish men. The decline was largest in Finland and Norway, at ∼0.4 mmol/L per decade. The decline in TC in most Western countries was the net effect of an increase in HDL cholesterol and a decline in non-HDL cholesterol, with the HDL cholesterol increase largest in New Zealand and Switzerland. Mean total-to-HDL cholesterol ratio declined in Japan, South Korea and most Western countries, by as much as ∼0.7 per decade in Swiss men (equivalent to ∼26% decline in coronary heart disease risk per decade). The ratio increased in China. Conclusions: HDL cholesterol has risen and the total-to-HDL cholesterol ratio has declined in many Western countries, Japan and South Korea, with only a weak correlation with changes in TC or non-HDL cholesterol.
Abstract High blood cholesterol is typically considered a feature of wealthy western countries1,2. However, dietary and behavioural determinants of blood cholesterol are changing rapidly throughout the world3 and countries are using lipid-lowering medications at varying rates. These changes can have distinct effects on the levels of high-density lipoprotein (HDL) cholesterol and non-HDL cholesterol, which have different effects on human health4,5. However, the trends of HDL and non-HDL cholesterol levels over time have not been previously reported in a global analysis. Here we pooled 1,127 population-based studies that measured blood lipids in 102.6 million individuals aged 18 years and older to estimate trends from 1980 to 2018 in mean total, non-HDL and HDL cholesterol levels for 200 countries. Globally, there was little change in total or non-HDL cholesterol from 1980 to 2018. This was a net effect of increases in low- and middle-income countries, especially in east and southeast Asia, and decreases in high-income western countries, especially those in northwestern Europe, and in central and eastern Europe. As a result, countries with the highest level of non-HDL cholesterol—which is a marker of cardiovascular risk—changed from those in western Europe such as Belgium, Finland, Greenland, Iceland, Norway, Sweden, Switzerland and Malta in 1980 to those in Asia and the Pacific, such as Tokelau, Malaysia, The Philippines and Thailand. In 2017, high non-HDL cholesterol was responsible for an estimated 3.9 million (95% credible interval 3.7 million–4.2 million) worldwide deaths, half of which occurred in east, southeast and south Asia. The global repositioning of lipid-related risk, with non-optimal cholesterol shifting from a distinct feature of high-income countries in northwestern Europe, north America and Australasia to one that affects countries in east and southeast Asia and Oceania should motivate the use of population-based policies and personal interventions to improve nutrition and enhance access to treatment throughout the world.
Abstract Genetic studies of blood pressure (BP) to date have mainly analyzed common variants (minor allele frequency > 0.05). In a meta-analysis of up to similar to 1.3 million participants, we discovered 106 new BP-associated genomic regions and 87 rare (minor allele frequency ≤ 0.01) variant BP associations (P < 5 x 10(−8)), of which 32 were in new BP-associated loci and 55 were independent BP-associated single-nucleotide variants within known BP-associated regions. Average effects of rare variants (44% coding) were similar to 8 times larger than common variant effects and indicate potential candidate causal genes at new and known loci (for example, GATA5 and PLCB3). BP-associated variants (including rare and common) were enriched in regions of active chromatin in fetal tissues, potentially linking fetal development with BP regulation in later life. Multivariable Mendelian randomization suggested possible inverse effects of elevated systolic and diastolic BP on large artery stroke. Our study demonstrates the utility of rare-variant analyses for identifying candidate genes and the results highlight potential therapeutic targets.
Abstract The FLUXNET2015 dataset provides ecosystem-scale data on CO2, water, and energy exchange between the biosphere and the atmosphere, and other meteorological and biological measurements, from 212 sites around the globe (over 1500 site-years, up to and including year 2014). These sites, independently managed and operated, voluntarily contributed their data to create global datasets. Data were quality controlled and processed using uniform methods, to improve consistency and intercomparability across sites. The dataset is already being used in a number of applications, including ecophysiology studies, remote sensing studies, and development of ecosystem and Earth system models. FLUXNET2015 includes derived-data products, such as gap-filled time series, ecosystem respiration and photosynthetic uptake estimates, estimation of uncertainties, and metadata about the measurements, presented for the first time in this paper. In addition, 206 of these sites are for the first time distributed under a Creative Commons (CC-BY 4.0) license. This paper details this enhanced dataset and the processing methods, now made available as open-source codes, making the dataset more accessible, transparent, and reproducible.
Abstract Background: Raised blood pressure is an important risk factor for cardiovascular diseases and chronic kidney disease. We estimated worldwide trends in mean systolic and mean diastolic blood pressure, and the prevalence of, and number of people with, raised blood pressure, defined as systolic blood pressure of 140 mm Hg or higher or diastolic blood pressure of 90 mm Hg or higher. Methods: For this analysis, we pooled national, subnational, or community population-based studies that had measured blood pressure in adults aged 18 years and older. We used a Bayesian hierarchical model to estimate trends from 1975 to 2015 in mean systolic and mean diastolic blood pressure, and the prevalence of raised blood pressure for 200 countries. We calculated the contributions of changes in prevalence versus population growth and ageing to the increase in the number of adults with raised blood pressure. Findings: We pooled 1479 studies that had measured the blood pressures of 19.1 million adults. Global age-standardised mean systolic blood pressure in 2015 was 127.0 mm Hg (95% credible interval 125.7–128.3) in men and 122.3 mm Hg (121.0–123.6) in women; age-standardised mean diastolic blood pressure was 78.7 mm Hg (77.9–79.5) for men and 76.7 mm Hg (75.9–77.6) for women. Global age-standardised prevalence of raised blood pressure was 24.1% (21.4–27.1) in men and 20.1% (17.8–22.5) in women in 2015. Mean systolic and mean diastolic blood pressure decreased substantially from 1975 to 2015 in high-income western and Asia Pacific countries, moving these countries from having some of the highest worldwide blood pressure in 1975 to the lowest in 2015. Mean blood pressure also decreased in women in central and eastern Europe, Latin America and the Caribbean, and, more recently, central Asia, Middle East, and north Africa, but the estimated trends in these super-regions had larger uncertainty than in high-income super-regions. By contrast, mean blood pressure might have increased in east and southeast Asia, south Asia, Oceania, and sub-Saharan Africa. In 2015, central and eastern Europe, sub-Saharan Africa, and south Asia had the highest blood pressure levels. Prevalence of raised blood pressure decreased in high-income and some middle-income countries; it remained unchanged elsewhere. The number of adults with raised blood pressure increased from 594 million in 1975 to 1.13 billion in 2015, with the increase largely in low-income and middle-income countries. The global increase in the number of adults with raised blood pressure is a net effect of increase due to population growth and ageing, and decrease due to declining age-specific prevalence. Interpretation: During the past four decades, the highest worldwide blood pressure levels have shifted from high-income countries to low-income countries in south Asia and sub-Saharan Africa due to opposite trends, while blood pressure has been persistently high in central and eastern Europe.
Abstract Motivation: Aquatic insects comprise 64% of freshwater animal diversity and are widely used as bioindicators to assess water quality impairment and freshwater ecosystem health, as well as to test ecological hypotheses. Despite their importance, a comprehensive, global database of aquatic insect occurrences for mapping freshwater biodiversity in macroecological studies and applied freshwater research is missing. We aim to fill this gap and present the Global EPTO Database, which includes worldwide geo-referenced aquatic insect occurrence records for four major taxa groups: Ephemeroptera, Plecoptera, Trichoptera and Odonata (EPTO). Main type of variables contained: A total of 8,368,467 occurrence records globally, of which 8,319,689 (99%) are publicly available. The records are attributed to the corresponding drainage basin and sub-catchment based on the Hydrography90m dataset and are accompanied by the elevation value, the freshwater ecoregion and the protection status of their location. Spatial location and grain: The database covers the global extent, with 86% of the observation records having coordinates with at least four decimal digits (11.1 m precision at the equator) in the World Geodetic System 1984 (WGS84) coordinate reference system. Time period and grain: Sampling years span from 1951 to 2021. Ninety-nine percent of the records have information on the year of the observation, 95% on the year and month, while 94% have a complete date. In the case of seven sub-datasets, exact dates can be retrieved upon communication with the data contributors. Major taxa and level of measurement: Ephemeroptera, Plecoptera, Trichoptera and Odonata, standardized at the genus taxonomic level. We provide species names for 7,727,980 (93%) records without further taxonomic verification. Software format: The entire tab-separated value (.csv) database can be downloaded and visualized at https://glowabio.org/project/epto_database/. Fifty individual datasets are also available at https://fred.igb-berlin.de, while six datasets have restricted access. For the latter, we share metadata and the contact details of the authors.
Abstract Large consortia have revealed hundreds of genetic loci associated with anthropometric traits, one trait at a time. We examined whether genetic variants affect body shape as a composite phenotype that is represented by a combination of anthropometric traits. We developed an approach that calculates averaged PCs (AvPCs) representing body shape derived from six anthropometric traits (body mass index, height, weight, waist and hip circumference, waist-to-hip ratio). The first four AvPCs explain >99% of the variability, are heritable, and associate with cardiometabolic outcomes. We performed genome-wide association analyses for each body shape composite phenotype across 65 studies and meta-analysed summary statistics. We identify six novel loci: LEMD2 and CD47 for AvPC1, RPS6KA5/C14orf159 and GANAB for AvPC3, and ARL15 and ANP32 for AvPC4. Our findings highlight the value of using multiple traits to define complex phenotypes for discovery, which are not captured by single-trait analyses, and may shed light onto new pathways.
Abstract Quantifying the genetic correlation between cancers can provide important insights into the mechanisms driving cancer etiology. Using genome-wide association study summary statistics across six cancer types based on a total of 296,215 cases and 301,319 controls of European ancestry, here we estimate the pair-wise genetic correlations between breast, colorectal, head/neck, lung, ovary and prostate cancer, and between cancers and 38 other diseases. We observed statistically significant genetic correlations between lung and head/neck cancer (rg = 0.57, p = 4.6 × 10−8), breast and ovarian cancer (rg = 0.24, p = 7 × 10−5), breast and lung cancer (rg = 0.18, p =1.5 × 10−6) and breast and colorectal cancer (rg = 0.15, p = 1.1 × 10−4). We also found that multiple cancers are genetically correlated with non-cancer traits including smoking, psychiatric diseases and metabolic characteristics. Functional enrichment analysis revealed a significant excess contribution of conserved and regulatory regions to cancer heritability. Our comprehensive analysis of cross-cancer heritability suggests that solid tumors arising across tissues share in part a common germline genetic basis.
Abstract Few genome-wide association studies (GWAS) account for environmental exposures, like smoking, potentially impacting the overall trait variance when investigating the genetic contribution to obesity-related traits. Here, we use GWAS data from 51,080 current smokers and 190,178 nonsmokers (87% European descent) to identify loci influencing BMI and central adiposity, measured as waist circumference and waist-to-hip ratio both adjusted for BMI. We identify 23 novel genetic loci, and 9 loci with convincing evidence of gene-smoking interaction (GxSMK) on obesity-related traits. We show consistent direction of effect for all identified loci and significance for 18 novel and for 5 interaction loci in an independent study sample. These loci highlight novel biological functions, including response to oxidative stress, addictive behaviour, and regulatory functions emphasizing the importance of accounting for environment in genetic analyses. Our results suggest that tobacco smoking may alter the genetic susceptibility to overall adiposity and body fat distribution.
Abstract The breast cancer risk variants identified in genome-wide association studies explain only a small fraction of the familial relative risk, and the genes responsible for these associations remain largely unknown. To identify novel risk loci and likely causal genes, we performed a transcriptome-wide association study evaluating associations of genetically predicted gene expression with breast cancer risk in 122,977 cases and 105,974 controls of European ancestry. We used data from the Genotype-Tissue Expression Project to establish genetic models to predict gene expression in breast tissue and evaluated model performance using data from The Cancer Genome Atlas. Of the 8,597 genes evaluated, significant associations were identified for 48 at a Bonferroni-corrected threshold of P < 5.82 × 10−6, including 14 genes at loci not yet reported for breast cancer. We silenced 13 genes and showed an effect for 11 on cell proliferation and/or colony-forming efficiency. Our study provides new insights into breast cancer genetics and biology.
Abstract Motivation: The BioTIME database contains raw data on species identities and abundances in ecological assemblages through time. These data enable users to calculate temporal trends in biodiversity within and amongst assemblages using a broad range of metrics. BioTIME is being developed as a community‐led open‐source database of biodiversity time series. Our goal is to accelerate and facilitate quantitative analysis of temporal patterns of biodiversity in the Anthropocene. Main types of variables included: The database contains 8,777,413 species abundance records, from assemblages consistently sampled for a minimum of 2 years, which need not necessarily be consecutive. In addition, the database contains metadata relating to sampling methodology and contextual information about each record. Spatial location and grain: BioTIME is a global database of 547,161 unique sampling locations spanning the marine, freshwater and terrestrial realms. Grain size varies across datasets from 0.0000000158 km2 (158 cm2) to 100 km2 (1,000,000,000,000 cm2). Time period and grain: BioTIME records span from 1874 to 2016. The minimal temporal grain across all datasets in BioTIME is a year. Major taxa and level of measurement: BioTIME includes data from 44,440 species across the plant and animal kingdoms, ranging from plants, plankton and terrestrial invertebrates to small and large vertebrates. Software format: .csv and .SQL.
Abstract Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57–1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628–0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.
Abstract Reduced lung function predicts mortality and is key to the diagnosis of chronic obstructive pulmonary disease (COPD). In a genome-wide association study in 400,102 individuals of European ancestry, we define 279 lung function signals, 139 of which are new. In combination, these variants strongly predict COPD in independent populations. Furthermore, the combined effect of these variants showed generalizability across smokers and never smokers, and across ancestral groups. We highlight biological pathways, known and potential drug targets for COPD and, in phenome-wide association studies, autoimmune-related and other pleiotropic effects of lung function–associated variants. This new genetic evidence has potential to improve future preventive and therapeutic strategies for COPD.
Abstract Nomenclatural type definitions are one of the most important concepts in biological nomenclature. Being physical objects that can be re-studied by other researchers, types permanently link taxonomy (an artificial agreement to classify biological diversity) with nomenclature (an artificial agreement to name biological diversity). Two proposals to amend the International Code of Nomenclature for algae, fungi, and plants (ICN), allowing DNA sequences alone (of any region and extent) to serve as types of taxon names for voucherless fungi (mainly putative taxa from environmental DNA sequences), have been submitted to be voted on at the 11th International Mycological Congress (Puerto Rico, July 2018). We consider various genetic processes affecting the distribution of alleles among taxa and find that alleles may not consistently and uniquely represent the species within which they are contained. Should the proposals be accepted, the meaning of nomenclatural types would change in a fundamental way from physical objects as sources of data to the data themselves. Such changes are conducive to irreproducible science, the potential typification on artefactual data, and massive creation of names with low information content, ultimately causing nomenclatural instability and unnecessary work for future researchers that would stall future explorations of fungal diversity. We conclude that the acceptance of DNA sequences alone as types of names of taxa, under the terms used in the current proposals, is unnecessary and would not solve the problem of naming putative taxa known only from DNA sequences in a scientifically defensible way. As an alternative, we highlight the use of formulas for naming putative taxa (candidate taxa) that do not require any modification of the ICN.
Corneal curvature, a highly heritable trait, is a key clinical endophenotype for myopia - a major cause of visual impairment and blindness in the world. Here we present a trans-ethnic meta-analysis of corneal curvature GWAS in 44,042 individuals of Caucasian and Asian with replication in 88,218 UK Biobank data. We identified 47 loci (of which 26 are novel), with population-specific signals as well as shared signals across ethnicities. Some identified variants showed precise scaling in corneal curvature and eye elongation (i.e. axial length) to maintain eyes in emmetropia (i.e. HDAC11/FBLN2 rs2630445, RBP3 rs11204213); others exhibited association with myopia with little pleiotropic effects on eye elongation. Implicated genes are involved in extracellular matrix organization, developmental process for body and eye, connective tissue cartilage and glycosylation protein activities. Our study provides insights into population-specific novel genes for corneal curvature, and their pleiotropic effect in regulating eye size or conferring susceptibility to myopia.
Lung-function impairment underlies chronic obstructive pulmonary disease (COPD) and predicts mortality. In the largest multi-ancestry genome-wide association meta-analysis of lung function to date, comprising 580,869 participants, we identified 1,020 independent association signals implicating 559 genes supported by ≥2 criteria from a systematic variant-to-gene mapping framework. These genes were enriched in 29 pathways. Individual variants showed heterogeneity across ancestries, age and smoking groups, and collectively as a genetic risk score showed strong association with COPD across ancestry groups. We undertook phenome-wide association studies for selected associated variants as well as trait and pathway-specific genetic risk scores to infer possible consequences of intervening in pathways underlying lung function. We highlight new putative causal variants, genes, proteins and pathways, including those targeted by existing drugs. These findings bring us closer to understanding the mechanisms underlying lung function and COPD, and should inform functional genomics experiments and potentially future COPD therapies.