GWAS using 2b-RAD sequencing identified three mastitis important SNPs via two-stage association analysis in Chinese Holstein cows

Fan Yang; Fanghui Chen; Lili Li; Li Yan; Tarig Badri; Chenglong Lv; Daolun Yu; Jie Chen; Chaofeng Xing; Jie Li; Genlin Wang; Honglin Li; Jun Li; Yafei Cai

doi:10.1101/434340

Abstract

Background Bovine mastitis is a key disease restricting developing global dairy industry. Genomic wide association studies (GWAS) provided a convenient way to understand the biological basis of mastitis and better prevent or treat the disease. 2b-RADseq is a reduced-representation sequencing that offered a powerful method for genome-wide genetic marker development and genotyping. This study, GWAS using two-stage association analysis identified mastitis important genes’ single nucleotide polymorphisms (SNP) in Chinese Holstein cows.

Results In the selected Chinese Holstein cows’ population, we identified 10,058 SNPs and predicted their allele frequencies. In stage I, 42 significant SNPs screened out in Chinese Holstein cows via Bayesian (P<0.001), while logistic regression model identified 51 SNPs (P<0.01). Twenty-seven significant SNPs appeared simultaneously in both analytical models, which of them only three significant SNPs (rs75762330, C>T, PIC=0.2999; rs88640083, A>G, PIC=0.1676; rs20438858, G>A, PIC=0.3366) located in non-coding region (introns and intergenic) screened out associated with inflammation or immune response. GO enrichment analysis showed that they annotated to three genes (PTK2B, SYK and TNFRSF21), respectively. Stage II? case-control study used to verify three important SNPs associated with dairy cows mastitis traits in independent population. Data suggested that the correlation between these three SNPs (rs75762330, P<0.025; rs88640083, P<0.005; rs20438858, P<0.001) and mastitis traits in dairy cows were consistent with stage I.

Conclusion Two-stage association analysis approved that three significant SNPs associated with mastitis traits in Chinese Holstein cows. Gene function analysis indicated that three genes (PTK2B, SYK and TNFRSF21) involved in inflammation and immune response of dairy cows. Suggesting that they as new candidate genes have an impact on mastitis susceptibility (PTK2B and SYK, OR>1) or resistance (TNFRSF21, OR<1) in Chinese Holstein cows.

Background

Bovine mastitis is the most complex and costly disease with high incidence, which seriously affects developing dairy industry worldwide (MAUNSELLet al. 1998; SCHUKKENet al. 2009; WELDERUFAELet al. 2017). Infection with mastitis causes direct economic losses in several ways, including dramatically discount in milk yield, treatment costs, condemnation of milk because of antibiotic or bacterial contamination. Also, higher than spontaneous elimination rates as well as, occasionally death of milk producer cows(SWINKELSet al. 2005; HALASAet al. 2007; HALASAet al. 2009; HOGEVEENet al. 2011). Therefore, despite improvements in the breeding of disease-resistant cows, mastitis continues to be a notable challenge and the major profitable issue for dairy farmers. Previous studies reported that cow mastitis was a complex quantitative trait affected by multiple reasons, including genetic features, pathogen infections (HERTLet al. 2014; MOOSAVIet al. 2014; USMANet al. 2015; POKORSKAet al. 2016; KIKUet al. 2017). It’s confirmed that bovine milk somatic cell count (SCC) or log-transformed SCC (somatic cell score, SCS) are the primary trait for detection of mastitis and have high hereditary capacity (WANGet al. 2015). Thus, Screening and identifying susceptibility or resistance genes associated with mastitis traits will improve the properties of dairy cow populations and is worthwhile to reduce the incidence of mastitis (SAHANAet al. 2014; KADRIet al. 2015; WANGet al. 2015). Different research strategies successfully used to identify significant genes associated with the mastitis traits, including SNP in a candidate gene, quantitative trait loci (QTL) and GWAS (BRONDUMet al. 2015; POKORSKAet al. 2016; ZHANGet al. 2016).

GWAS provides a convenient way to understand the biological basis of disease and better prevention or treatment (VISSCHERet al. 2017). In the past decade, it has been extensively in screen candidate gene mutagenesis to improve population productivity and disease resistance traits (DAETWYLERet al. 2014; CRISPIMet al. 2015; SAOWAPHAKet al. 2017; SELIMOVIC-HAMZAet al. 2017; VARSHNEYet al. 2017). It also widely regarded as a potential molecular marker assisted selection method based on SNPs in dairy cattle mastitis traits (WIGGANSet al. 2009; WANG et al. 2015). At chromosome level, GWAS data showed that Bos Taurus autosome 2, 4, 6, 10, 14, 18, and 20 associated with clinical mastitis significantly correlated with somatic cell scores in cows (SODELANDet al. 2011; MEREDITHet al. 2012; WIJGAet al. 2012). Besides, GWAS has arisen as one of the primary strategies in finding genetic variations associated with the traits. And many genetic associations have determined for a wide variety of prevalent, complex diseases as described in the GWAS list (HINDORFFet al. 2009). Sahana and his colleagues reported two clinical mastitis candidate genes (vitamin D-binding protein precursor, GC and neuropeptide FF receptor 2, NPFFR2) using high-density single nucleotide polymorphic array and WGAS (SAHANAet al. 2014). These two candidate genes detected to associate with mastitis traits in dairy cows through genomic sequencing in 2016 (ZHANGet al. 2016). In 2015, Wang et al. identified another two mastitis susceptibility genes (TRAPPC9 and ARHGAP39) in Chinese Holstein (WANGet al. 2015). However, Wu et al. detected five mastitis susceptibility genes (NPFFR2, SLC4A4, DCK, LIFR and EDN3) in Danish Holsteins (WUet al. 2015). Genetic variations in immune response, specific pathogen (LY75, DPP4, ITGB6 and NR4A2) and lymphocyte antigen-6 complex genes (LY6K, LY6D, LYNX1, LYPD2, SLURP1 and PSCA) might lead to clinical mastitis in American Holstein cows (TIEZZIet al. 2015). Additionally, single gene polymorphisms (CXCR1, MAP4K4) and their signaling pathways (TLR4/NF-κB) served as genetic markers for mastitis in different cow populations (POKORSKAet al. 2016; BHATTARAIet al. 2017). These results suggested that genetic variations or polymorphisms associated with mastitis traits are inconsistent, should screen and validated in different populations.

2b-RADseq, considered as a simplified and flexible restriction site-associated DNA (RAD) genotyping method based on IIB restriction endonuclease, provides a powerful method for identifying gene SNP in the population genome. It has strong technical repeatability, uniform depth of sequencing, high cost-effectiveness and genome coverage (WANGet al. 2012; GUOet al. 2014). Furthermore, the 2b-RADseq technique successfully predicted multilocus sequence typing (MLST) as well as provide more detailed on the population information than MLST technique. Therefore, the cost-effective and timesaving analysis strategy provided for large-scale studies on molecular epidemiology, public hygiene, systematic bacterial genetics, population genetics and bio-safety (PAULETTOet al. 2016; HERNANDEZ-CASTROet al. 2017). Also, this method also suitable for erecting high-density genetic or linkage maps of genomic region or locus markers and revealing the regions associated with related traits by QTL mapping and association analysis (JIAOet al. 2014; ZHAOet al. 2017). More importantly, 2b-RAD can gain many SNPs through deep sequencing with fewer samples, and then identify the candidate genes related to traits (LUOet al. 2017). Therefore, 2b-RAD may be an ideal genotyping platform for screening mastitis resistance or susceptibility genes in dairy cattle.

In this study in order to identify mastitis susceptibility or resistance SNPs in Chinese Holstein and better understand the genetic and biological pathway of mastitis. We carried out: 1) 2b-RAD sequencing technique to sequence the whole genome for dairy cattle. 2) Identified and RADtyping SNPs. 3) GWAS analyzed the significant SNPs associated with mastitis traits via logistic regression analysis models. 4) Case-control study validated significant SNPs in independent dairy population. Then identified mastitis susceptibility or resistance SNPs, and evaluated the potential value of their associated genes in traits of Chinese Holstein cows.

Methods

Sample libraries and preparation

The experimental Chinese Holstein cows were from two different pastures of the same Dairy Company (Nanjing Weigang Dairy Co., Ltd.). Forty dairy cows selected from 596 lactating Chinese Holstein cows, which divided into two subgroups according to their clinical mastitis phenotypes: case group (20 cows) and control group (20 cows). 383 cows screened from 886 lactating cows in another pasture, with 73 in case group and 310 in control group. In their respective pastures, all animals have the same growth and feeding environment, similar production levels, the equivalent parity and lactation period.

Blood sampling performed using the tail vein blood sampling minimizes damage to cows (Firstly, the cows fixed in the column holders and the tails exposed outside the frame. Secondly, the blood collectors grasped the cow’s tails and lifted it upwards; they sterilized by alcohol cotton balls at the depressions at the midpoint of the 4th and 5th tail vertebrae. Then the tube blood collector penetrated the tail vein vertically to draw blood. Finally, after the blood drawn, the needle’s eye area pressed with a cotton ball for 30 seconds to fix the hemostasis and release the cows). Genomic DNA extracted from whole blood using TIANamp Genomic DNA Kit. The quality of genomic DNA detected by NanoDrop and Agraros Gel methods (extracted 3 microliters of genomic DNA, loaded on 1% agarose gel, 100 V CV 25 Minutes, viewed under ultraviolet light and photographed.).

2b-RAD library and sequencing

Forty sample libraries set up met a protocol developed by 2b-RAD sequencing needs with a little change and five-label tandem technique (RUBINet al. 2010; WANGet al. 2012). The Bos Taurus genomes (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/055/GCF_000003055.6_Bos_taurus_UMD_3.1.1/GCF_000003055.6_Bos_taurus_UMD_3.1.1_genomic.fna.gz) used as the reference for predicting electronic-enzyme-cut digestion of genomic DNA. Finally, Bael enzyme selected to digest genomic DNA. The restriction enzyme digested DNA fragment tags of each sample linked by standard 5‘-NNN-3’ connector. Paired-end sequencing carried out on the Illumina Hiseq Xten (https://support.illumina.com/downloads/sequencing-analysis-viewer-software-v2-18.html) platform after the quality control of the library was up to standard. Constructed the library followed WANG et al. (WANGet al. 2016a)(Figure S1) and the steps of the modeling included: (1) enzymatic digestion: ≥ 200ng genomic DNA digested by IIB restriction endonuclease; (2) Adding connectors: 5 different sets of connectors added to the digestion products respectively, with T4 DNA Ligase connection; (3) Amplification: T4 DNA Ligase connection products amplified by PCR; (4) Series: according to the information of 5 sets of connectors, the five labels connected in series; (5) Pooling: Barcode sequence added to the connection products and mixed library; (6) Sequencing: high-quality library that qualified, then on-machine sequencing.

Raw Reads quality control

The original sequencing (Raw Data or Raw Reads) gained using the Illumina HiSeq sequencing platform. The Phred value was a role of sequence base error rate (Figure S2). It gained by calculating the probability model of prediction base recognition error base recognition. The calculation formula was: Q_Phred = −10log₁₀(phared) (Table S1). Then deleted the reads contained the junction sequence and N base ratio ≥8% reads, got Clean Reads, then spliced by Pear (Version 0.9.6) software (http://pear.php.net/package/HTTP_WebDAV_Client/download/0.9.6/). Based on locating each sample, high-quality Enzyme Reads containing cleavage recognition sites extracted. SOAP (version 2.21) (http://soap.genomics.org.cn/soapaligner.html#down2) (Short Oligonucleotide Analysis Package) software used to align Enzyme Reads with the reference sequence (-r0 denotes unique comparison; -M4 represents optimal comparison; -v2 index comparisons allow two mismatched). Unique tags gained by the same Reads clustering showed the sequencing depth of the tags.

SNPs genotyping and Linkage Disequilibrium (LD) analysis

SNP marker typing (RAD typing) performed on Enzyme Reads using the maximum likelihood (ML) method in SOAP software. The statistic SNP typing results, using R Package cluster analysis of the differences between sample SNPs. SNP annotated using SnpEff software (Version: 4.1g) (http://snpeff.sourceforge.net/) to determine located the SNPs in the gene and affected amino acid changes.

Plink software used to calculate the r² value of the pairwise SNPs, with the main parameters set to: --r²-Id-window-kb 1000-Id-window 50-Id-window-r² 0.2. According to the median of the software, the work F(x) = 1/ (log₁₀ ((x+10^(7-C))/10⁷) + C) used to fit, and mapping by chromosome/grouping. To find the LD block in the case-control group, we add the parameters “block output GAB-pair wise Tagging’ to R package runner. Based on the 2b-RAD sequencing results, determined the optimal maximum distance for each LD block.

Population structure and genetic diversity

After the 2b-RAD sequencing technology processed the samples, the PC analysis method used to evaluate the population. Then a correlation test performed for each subgroup, including the first five PC selected as covariate analysis signs for population division adjustment. The PC detailed data showed in Table S3. As far as we know, the first two influential feature vectors selected to draw the correlation between the samples. As shown in Figure S5 (a), case-control overlaps in two groups, no outliers detected, and it conformed to the rule of sample collection. To assess experimental samples’ genetic diversity, polymorphic information content (PIC), observed heterozygosity (Ho) and expected heterozygosity (He) values also calculated for each SNP site. In addition, the genetic differentiation coefficient between the subgroups was statistical.

SNPs and mastitis traits association analysis

We considered a GWAS for the quality traits of dairy cow mastitis, and 2b-RAD markers genotype for each individual SNP locus. To ensure the accuracy of the analysis, we used multi-stage GWAS identified important SNPs associated with mastitis traits. Stage 1: we selected clinical mastitis and normal control Chinese Holstein cows for GWAS. We assumed that there is a true SNP tag associated with mastitis for each genotype in the genomic region. And calculated the correlation statistic for each SNP site and selected the strongest associated SNPs as the causal SNPs. If the region contains a single SNPs, then the most significant related SNPs is most likely a causal SNPs. Calculated the associated statistics of the SNP and identified significant SNPs. GWAS analysis performed using Bayesian and Logistic regression analysis model to compare important SNPs between case-controls. Quantile-Quantile Plot (QQ-plot) evaluated the rationality of the two analysis models. GO enrichment analysis performed on all genes with SNPs, and their functions described in conjunction with GO annotations. Hypergeometric Distribution Test (Cytoscape software) used to calculate significant gene enrichment in each GO entry. Stage 2: in another independent Chinese Holstein dairy population, validated these important SNPs screened in stage 1.

Statistical model

Principal analysis method (PCA)

PCA is a method that uses for dimensionality reduction on data to study how to condense many original into a few factors with minimal information loss (LIet al. 2017). In this experiment, let F₁ denote the main sub-index formed by the first linear combination of the original, F₁=a₁₁X₁+ a₁₂X₁+…… +a_1mX_m, (m stands for the mth index). Information obtained by each principal component can measure by its variance. The larger the variance, the more information the F contains. If the first principal component is not enough to represent the initial m indicators, then consider selecting the second index F₂. The existing data of F₁ does not need to appear in F₂ again. That is, F₂ and F₁ should be independent, irrelevant, and expressed by their covariance. And so on to build F₁, F₂… F_n, as equation (1).

Bayesian and Logistic regression model association Analysis

Linear models are a common method for correlation analysis of phenotypes and genotypes. Strict quality control used to remove poorly performing SNP marker loci in RAD typing. Bayesian and Logistic regression model introduced for GWAS detected SNPs associated with clinical mastitis in dairy cows. First, built the following linear regression equation based on phenotype (GUOet al. 2018):

Where y_i is a vector of phenotype for individual i; M is the number of SNPs; μ is a vector of the overall mean of traits phenotypes; α_k is a vector of additive correlation effect of the kth SNPs; Xik is a vector of the genotype (0, 1, or 2) of the kth SNPs observed on the ith individual; and e is a vector of residual effect.

The Bayesian model assumed the SNPs effect was a prior normal distribution. Firstly, we consider the possibility that each SNP locus truly associated with the mastitis phenotype in GWAS. Select a value п for the prior probability H₁. The correlation between SNPs and dairy mastitis traits quantified using п values. A pre-estimation of the SNPs truly associated with cow mastitis trait performed by a specific п value (10⁻⁴−10⁻⁶). While the probability of H₀ considered to be (1-п). Secondly, calculated the Bayes factor for each SNPs. The Bayesian factor (BF) is the ratio between the probability of data at H₁ and H₀. Null assumption is H₀ (θ_het = θ_hom = 0). H₁, at least one θ_het =t₁ and θ_hom =t₂ value is non-zero (WELLCOME TRUST CASE CONTROLet al. 2012).

Where, θ_het is odds ratios (ORs) logarithm between the heterozygote and the common homozygote. θ_hom is the ORs logarithm between rare and common homozygotes. Then counted the posterior odds (PO) under the H₁ condition: PO= BF×п/ (1-п). And posterior probability of association (PPA= PO/ (1+PO)) can be regarded as a Bayesian simulation of P value.

The SNPs effects variances were independent of each other, and each of which followed the same independent distribution (IID) as the inverse chi-square prior normal distribution where v is a parameter of the degree of freedom and S² the parameter of scale:

A prior distribution of the criticality of each SNP effect was a t-distribution (MEUWISSENet al. 2001; GUOet al. 2018):

The prior for a_k depends on the variance of each SNPs, and each variance has an inverse Chi-square. SNP has null effect with probability п or is a normal distribution with probability (1-п), (GIANOLA 2013):

Where is represents the common variance of all non-zero SNPs effects, and it prorated prior distribution of the chis-square, . The unknown п value in the model predicted from its prior distribution (considered as uniform between 0 and 1) or п− uniform (0, 1).

v_a is designated as 4, is calculated by additive variance.

Where the allele frequency of the kth SNP is P_k the variance of a given tag is, and the additive genetic variance is elucidated by SNPs.

Then assuming that SNPs affects the mastitis phenotypic traits, we constructed logistic regression equation to predict SNPs associated with clinical mastitis in dairy cows. And we established a fitted logistic regression equation (BISCARINIet al. 2016; WANGet al. 2016b):

Where, P_j is the probability of occurrence of clinical phenotype under a condition X_ij; (1-p_j) is the probability that phenotype does not occur; X_ij = (X_1j, X_2j, X_3j…… X_mj) is the genotype of individual i at position j (0, 1, or 2); β_j is the effect of jth SNPs; and m is the number of samples; μ is the overall mean of traits phenotypes.

In the logistic regression model, Y= (μ+Σβ_iX_i) or (log P/(1 −P)), the equation can transform into another equation form:

Where P is clinical mastitis phenotype, X_i is the genotype of individual i, β_i is the odds ratio (OR). The equation of expression between P and variable X_i can derive by equation transformation:

The greater the value of β_i, the greater the influence of Y. 95% confidence interval: CI= exp (β_i±1.96SE (β_i)).

Case-control population verification analysis

We determined the number of validation samples using a matching design and case-control unequal (case/control=1/h).

Where, n is the number of cows in clinical mastitis. N is the total number of cows in verification population. P₀ is the exposure rate of SNPs in the control group. P₁ is the exposure rate of SNPs in the case group. OR is the exposure ratio (Odds ratio). α is the probability of hypothesis testing type I errors. β is the probability of hypothesis type II errors and (1− β) is the expected test assurance. OR 95% CI is 95% confidence interval.

The Attributable Fraction reflects the probability that a case will be randomly selected from the population due to the SNPs.

Where, I_e is the incidence of the site mutation group; I₀ is incidence of the site non-mutation group. Incidence is generally not available in case-control studies and only OR obtained. AF_e refers to the proportion of mastitis caused by the SNPs to all mastitis.

Where, AF_p indicates the proportion of mastitis caused by the SNPs in all mastitis. I_p is the total incidence of mastitis in Chinese Holstein cows. I₀ is the incidence of mastitis with non-mutation at the SNP locus. P_e is the mutation rate of SNP locus in control group.

Results

Restriction endonuclease digestion and unique tags statistics

In this project, Bael restriction endonuclease used to digest genomic DNA of Chinese Holstein cows. Cluster analysis performed on the same read to gain a unique tag for each sample, and calculated the depth of sequencing for each tag. Removing the tag with a sequencing depth of less than 3×, the average number of tags each sample was 198,948 and the average sequencing depth was 17.43× (Figure 1, Third Ring). The average tag spacing between tags was about 9589 bp (Figure S3) and the unique tags alignment ratio for all samples was 59.69% ~ 72.71%.

Figure 1

Schematic diagrams of all SNPs at chromosomal location cricos distribution and sequencing depth summary (a): Bayesian model analysis of SNPs quality traits in all samples (from outer to inner). The outmost circle was the chromosome scale; the second was the differential SNPs for all samples; the third was the average sequencing depth for each of the 40 samples at each locus (pink, if the depth over 50, count by 50) and the average depth of sequencing within 1 M window (black line, depth exceeds 50, count by 50); the fourth ring was the SNPs of the quality traits Chi square P< 0.05. (b): Logistic regression analysis of SNPs quality traits in all samples. The first to third rings were the same as (a). The fourth loop was the result of logistic regression analysis of differences in SNPs (P< 0.05).

SNPs RADtyping and genotype

After RADtyping and filtering, 10,058 SNPs (Figure 1, Second Ring) screened out for all samples. The distribution of SNPs on the chromosomes of each sample was based on sliding window statistics (Figure S4). Then we counted all the SNP locus allele frequencies. Cochran-Armitage test analyze association between single SNPs genotypes and case-control status showed in Figure S5(b). Here, since heterozygote risk assessment intermediated between two homozygotes, this line fit the data reasonably which matched to additive genotype risk. In this case there was no deviation, and the test was convincing. And details gave in Table S2. At the same time, we counted the genetic differentiation coefficient (Fst= 0.01869) between two groups.

The range of Fst value between groups was 0-1. Fst value was close to 0, which indicated that the genetic differentiation between the two groups was smaller.

Genome Wide Association Analysis

The SNPs associated statistic obeys the multivariate normal distribution. We also calculated the likelihood of possible causal states of the SNP. Each SNP has two potential causal: effect or no effect of the SNP. Therefore, for a possible subset of each, we need to consider the 2ⁿ likelihood of the SNP. For each of these states, a multivariate normal distribution used to calculate the probability of the data for a given causal state. Thus, to identify the best SNPs set, a large amount of computations must be performed. The data showed that analysis of the two models was a slightly different in quality traits for all SNPs. Bayesian analysis screened 42 significant SNPs when P<0.001 (Table S4), while logistic regression analysis model identified 51 SNPs under P<0.01 condition (Table S5). Under the above P-value conditions, 27 significant SNP sites appeared simultaneously in the two analytical models (Table 1). As expected, significant SNPs screened in the two analytical models under their P-value conditions will vary, respectively. The QQ-plot (Quantile-Quantile Plot) evaluated the rationality of the two statistical models. Figure 2 showed that P value observations consistent with expected values at all SNP site, indicated that the two analysis models were reasonable. In the upper-right corner of Figure 2(a), candidate sites with high significance and potential associated with mastitis traits. However, in Figure 2(b), SNPs that significantly associated with mastitis traits were not apparent. This might be related to the fact that cow mastitis controlled by micro-multiple genes; gene effects too weak, or the sequencing population size we selected was insufficient.

Figure 2

Q-Q plots (Quantile-Quantile plots) diagram for consistency of the observed and predicted values of SNPs P-value. (a) and (b) represented the consistency of Bayesian and logistic regression analysis for SNPs observed and expected value – log10 (P), respectively. (a) There were SNPs P values exceeded expected, which suggested that these locus might be significantly associated with dairy cows’ mastitis traits. (b) The P value observation is almost the same as the expected, indicated that the analysis model was reasonable.

In Bayesian analysis model (Table S4), The OR and 95% CI values of the SNPs rs21068792 were all “Na”, there was a missing value. The U95 of the 6 SNPs (rs98302192, rs49124945, rs57070376, rs13685463, rs57506421 and rs58979699) were “Nan”, which means meaningless number. The OR value of these 11 SNPs (rs114843903, rs38937721, rs5881560, rs17514753, rs17518215, rs22015301, rs77887746, rs9704351, rs20438858, rs26414259 and rs50888452) less than 1 indicated that these SNPs were protective reasons for the related phenotypes. The other 24 SNPs’ OR values greater than 1 indicated that these SNPs were risk for related phenotypes. With regard to logistic regression model, 51 SNPs marked when P<0.01(Table S5). A total of 22 SNPs’ OR value less than 1, and other 29 SNPs were greater than 1. We noticed that 8 (rs114843903, rs5881560, rs17514753, rs17518215, rs22015301, rs9704351, rs20438858 and rs5088452) SNPs’ OR values<1, while for Bayesian model as well. Table 1 also showed that 19 SNPs’ OR values in the two models were great than 1.

SNPs GO annotations

We annotated all 27 significant SNPs to determine their location in the chromosomal genome. Table 2 showed that 14 SNPs located in the intergenic region, 10 in intron, and 1 in 3’-UTR, upstream and downstream, respectively. Except for the rs33866959 (A>T, transition) site, all other sites are transversion. The PIC value of rs86640083 less than 0.25 (low polymorphism), while the others all in 0.25 to 0.5 (moderate polymorphism). Go enrichment for 27 significant SNPs revealed that only 3 SNPs (rs75762330 (C>T, PIC= 0.2999), rs88640083 (A>G, PIC= 0.1676) and rs20438858 (G>A, PIC= 0.3366)) associated with immune role (Table 3, Figure 3). SNPs rs75762330 (C>T, OR>1, PIC= 0.2999> 0.25) in PTK2B gene located on BTA 8, and belonged to moderate polymorphism. PTK2B, also called Pyk2, regulates humeral and homeostatic cell homeostasis (RACIOPPIet al. 2012; KREMERet al. 2014; RHEEet al. 2014; LLEWELLYNet al. 2017). And rs88640083 (A>G, OR>1, PIC=0.1676<0.25) in intergenic nearby SYK gene located on BTA 8 and was low polymorphism. SYK is a non-receptor tyrosine kinase and considered as an important regulator factor for adaptive immunity and played a vital role in TLR4 signaling pathway (CHOIet al. 2015; SCHWEIGHOFFERet al. 2017). SNPs rs20438858 (G>A, OR<1, PIC= 0.3366>0.25) in TNFRSF21 located on BTA 23 and was moderate polymorphism. TNFRSF21, also known as Death receptor 6 (DR6), is a member of the TNF/TNFR family and played a critical role in immune response and inflammation (LOCKSLEYet al. 2001; STRILICet al. 2016; FUJIKURAet al. 2017). Other 24 SNPs were statistically significant in both analytical models, but GO annotations showed that they did not have the function of inflammation or immune response Combining the two models provided support for comprehensive data from all SNPs, suggested that association between the 3 significant SNPs and the risk of mastitis in dairy cows based on conventionally accepted genome-wide statistical significance thresholds.

Figure 3

Three Significant SNPs associated with genes chromosome mapping for Chinese Holstein. Manhattan plots (a) and (b) showed the SNPs associated with mastitis in Chinese Holstein screened by two models, respectively. (a) was the result of Bayesian analysis, while (b) represented the related genes labeled by logistic analysis model. Red dots represented the chromosome location of the associated genes. (c-e) were partial LD block of three significant SNPs, respectively, with a distance interval of 1Mb, the more reddish the LD block color is, the stronger the correlation the dots. SNPs rs75762330 and rs77816736; rs88640083, rs85927029 and 8563916; and rs20438858, 19736020 and rs16711445 were in the same LD block (black circle), respectively, suggesting that their corresponding genes potential association with each other, respectively.

Correlation between SNPs

Calculated LD coefficients between two pairs of SNP markers in the genome, and then LD coefficients classified according to the distance between the markers (Figure 4). Finally, the average LD coefficients between molecular markers at a certain distance counted. The average LD coefficient of 100kb on the genome of case-control two Chinese Holstein cows was about 0.5. However, the corresponding LD coefficient was still above 0.3 when the distance was 1000 kb. Moreover, we could also observe from the figure that the LD decay speed and C value were same between case-control groups. Of course, we also noticed that LD decayed very slowly. 2b-RAD data shows that our SNP markers are sparse (9589 bp). Therefore, a suitable LD block map gained when the classification interval size was set to 10Mb.

Figure 4

The Linkage Disequilibrium attenuation curve of case-control SNPs. The lines of different colors represented different populations/chromosomes, horizontal coordinates were the physical distances between the SNPs pairs, and vertical coordinates were the average r2 values of the same physical distance marker pairs. As the distance between sites increases, r2 usually showed a decreasing trend. The larger the C value, the lower the probability of recombination between SNPs and LD attenuation distance; the smaller the C value, the higher the probability of recombination between SNPs and LD attenuation distance.

Figure 3(c) showed that rs75762330 associated with rs77816736, however, the latter P-value was >0.05 in both analytical models (Table S6). SNPs rs88640083 associated with rs85927029 and rs85635916 (Figure 3(d)), yet the latter two were statistically meaningless. SNPs rs19736020 and rs16711445 associated with rs20438858 (Figure 3(e)), and Table S6 shown that the first two were not statistically significant. We also calculated the linkage disequilibrium (LD) between three significant SNPs. Genetic linkage analysis showed that SNPs rs75762330 not correlated with rs88640083 (r²= 0.0022) and rs20438858 (r²= 0.043). However, rs20438858 weakly correlated with rs88640083 (r²= 0.22).

Three significant SNPs population verification

Correlation analysis performed on three important SNPs in another larger independent Chinese Holstein dairy population via direct sequencing (Figure 5, Table 4). We successfully performed PCR cloning near three important SNPs, then direct sequencing. Data shown that the three locus’ P-value was <0.05, indicated that they were statistically significant associations between Chinese Holstein cow mastitis. The correlation between rs20438858 and risk of mastitis was still statistically significant, with the adjustment allele OR= 0.359（OR <1）. While, the other two significant SNPs (rs75762330 and rs88640083) located on BTA 8, with the adjustment allele OR = 2.416 and 1.879 (OR >1), respectively. Table 4 also shown that base G in rs88640083 had higher occupancy rate in case group than control group. And in rs75762330 base T as well. However, in rs20438858, the probability of base T in case group was less than control group. We also noted that AFe value for rs75762330 and rs88640083 was 0.2489 and 0.2426 >0, respectively; while rs20438858 AFe value was −1.786<0. Three significant SNPs annotated to three candidate genes

Figure 5

Three SNPs were directly sequenced in an independent validation population of Chinese Holstein cows. (a) Gel electrophoresis pattern PCR amplified fragments near three significant SNPs, A-C were PCR amplified fragments of SNPs rs88640083, rs75762330 and rs20438858 regions, respectively. (b-d) Directing sequencing results of PCR amplification products near above three important SNPs, and their alignment with reference sequences (ref: reference sequences; 1: heterozygous sequences; 2: variant sequences). The purple boxes were where the three SNPs located. X, M and N represented the heterozygous types of the three SNPs, respectively.

GO enrichment analysis indicated that three important genes associated with adaptive and innate immune response in Chinese Holstein cows (Figure S6). Figure 6 also showed that these three candidate genes directly or indirectly affected the function of AKT1 and promoted the expression of pro-inflammatory cytokines in mammary epithelial cells and macrophages of dairy cows, suggested that these three genes are involved in mammary epithelial cells and macrophages the polarization-related biological functional activities. AKT1 (protein kinase B), As a key Jak2/STAT5 pathway protein, plays an important role in the regulation of differentiation, secretion, survival and proliferation of mammary epithelial cells and also plays a key role in mammary remodeling and lactation sustainability in dairy cows (MAROULAKOUet al. 2008; CHENet al. 2010; CREAMERet al. 2010; ARRANZet al. 2012; HOUet al. 2016), which is bound to play a key role in mediating the immune response to mastitis in dairy cattle.

Figure 6

Candidate genes interaction network Diagram based on KEGG Database. The network map was constructed with the three candidate genes as the core. The interaction between genes was represented by a line.

Discussion

Genetic analysis of GWAS had a considerable impact on the study of complex genetics (LOHet al. 2015). GWAS has also achieved unprecedented success in identifying gene regions and candidate gene variants closely related to clinical phenotypes and disease susceptibility (chromosome and gene level, in term of the association between SNPs and traits) (YANGet al. 2011; LEEet al. 2012; CROSS-DISORDER GROUP OF THE PSYCHIATRIC GENOMICSet al. 2013). GWAS develops new functional studies and provides therapeutic strategies by comparing multiple gene regions or candidate genes, identifying new candidate genes for causal pathways (DEELENet al. 2013). Moreover, identifying associated gene mutations could help reveal the pathogenesis of disease and provided cut-in points for treatment, and analysis of common genetic variations identified many risk loci for multiple complex diseases (XUet al. 2012; BERNDTet al. 2013). However, knowledge of disease biology and treatment remains limited. Gene’s functional changes caused by mutations associated with cow mastitis, which are also subtle and difficult to explain. Two-stage correlation analysis for three mastitis significant SNPs

Genomic prediction methods of genetic values might show different results for different phenotypes, and the results might be different due to different genetic structures among traits (MOSERet al. 2009; RESENDEet al. 2012). To improve accuracy, we used two-stage association analysis to reduce false positives. We reduced the dimensions when processing data, considering only the SNPs associated with inflammation and immune response. In stage I, considering that case-control state in the present study was not in a normal distribution, and to obtain accurate mastitis significant SNPs and genes information in Chinese Holstein, we tried to use two analysis models (Bayesian model and logical regression model) to carry out GWAS of 2b-RAD sequencing results. Several genetic background analyses have been found related to mastitis traits in dairy cattle populations, although to our knowledge no study has been conducted in Chinese Holstein dairy population to date using two GWAS analysis models at the same time. Comparison between the two models showed that although there were differences in SNP tagged under the same P value (P<0.05), the general trend of the association with mastitis was similar. The results suggested that Bayesian screen out more accurate significant SNPs (42, P<0.001, Table S4) in dairy cows, while logical regression analysis identified more SNPs (51, P<0.01, Table S5). Importantly, we identified three (rs75762330, rs88640083 and rs20438858) novel dairy cows mastitis traits significant SNPs in Chinese Holstein cows. SNPs rs75762330 within PTK2B and SNPs rs88640083 in intergenic nearby SYK located on BTA 8 were risk factors (OR>1), and the SNPs rs20438858 (OR<1) in TNFRSF21 located on BTA 23 was a protective factor for dairy cows mastitis.

With regarding to stage II, we used a case-control study to verify the association of three important SNP markers with cow mastitis. It compared the exposure ratios of important SNPs in case and control groups (BAGHERIet al. 2016; WEISSBRODet al. 2018; ZHOUet al. 2018). After statistical test, if there is significant difference between two groups, it can be considered that the SNPs associated to cow mastitis. When comparing the two groups, excluded the interference from external matching factors and only considered the relationship between SNPs and mastitis. According to the Pitman efficiency increment formula (2R/(R+1)), determined the appropriate sample size and gained higher test efficiency. Here, we validated the association of three important SNPs with cows mastitis. SNPs rs75762330 and rs88640083 were correcting factors (AFe>0) for cows’ mastitis, which associated with mastitis susceptibility. SNPs rs20438858 as a negative regulator (AFe<0) for cows’ mastitis and associated with mastitis resistance.

Three significant SNPs are located in genomic non-coding sequences

Previous studies found that conserved non-coding regions (CNCs) in introns and near genes show large allelic frequency shifts, similar in magnitude to missense variations, suggesting that CNCs are critical for gene function regulation and evolution in many species, including yeast, fruit flies and vertebrates (HAUDRYet al. 2013; VISSERet al. 2014; PETIBONet al. 2016; DICKELet al. 2018). However, The CNCs variation, which does not directly change the amino acid sequence, is the key to the regulation of gene genetic information expression and affects biological functions and diseases in mammalian (PATRUSHEV and KOVALENKO 2014). Our GWAS data provided a statistical list of SNPs associated with mastitis traits in dairy cows, where the associated significant SNPs are located in non-coding regions (intron and intergenic) of the genome. Functional annotations showed that the three SNPs (rs75762330, rs88640083 and rs20438858) were associated with immune and inflammatory responses in dairy cows, implicating them as key SNPs for mastitis in dairy cows. But the biological function behind this statistical association is still not known, because this association may stem from hindering another biological function, such as regulating function, or being affected by other functional SNPs, and this can only be illustrated by subsequent experimental studies.

Significant SNPs are at low or moderate genetic polymorphisms

Pathogen-specific mastitis traits are a direct indicator of cow mastitis infection. GWAS studies showed that the mastitis trait is a low genetic polygenic trait that is controlled by multiple sites distributed in the genome, and the genetic effect of each locus is relatively small (WUet al. 2015). Our data results were basically consistent with previous studies. Stage I data showed that rs88640083 (PIC=0.1676<0.25) was low polymorphism, while rs75762330 (0.25<PIC=0.2999<0.5) and rs20438858 (0.25<PIC=0.3366<0.5) were moderately. Three SNPs associated with mastitis also demonstrates that cow mastitis has multiple genetic effects.

Three important candidate genes biological function

Innate immune system is a key protective mechanism of bovine mammary gland against exogenous pathogen infection. GO function analysis annotated three significant SNPs into three important genes (PTK2B, SYK and TNFRSF21), which suggested that these three genes are novel candidate genes associated with mastitis traits in Chinese Holstein cows. PTK2B involved in regulating the LPS-TLR4 cascade in macrophages and affected the migration of dendritic cells (DCs) (RACIOPPIet al. 2012; RHEEet al. 2014). It was also an important homeostasis regulator in natural immune cells such as bone marrow mononuclear cells (RACIOPPIet al. 2012; RHEE et al. 2014; LLEWELLYNet al. 2017). As for SYK, it played an essential role in signal transduction of adaptive immune receptors and participated in the regulation of innate immune recognition, vascular development, platelet activation and cell adhesion (MOCSAIet al. 2010). Studies reported that the SYK was also involved in regulating the proliferation of dairy mammary epithelial cells, affected milking cycles and milk production (HOUet al. 2016). TNFRSF21 might play an important role in regulating the degeneration of the mammary gland and providing protection against infection (KHALILet al. 2011). We also noted that SYK and TNFRSF21 involved in “Toll-like” and “TNF/TNFR” signaling pathways, respectively, which are the key pathways to identify exogenous pathogens and induce inflammation and immune response.

Conclusions

In this study, we committed to improve understanding biogenetic variation of mastitis in Chinese Holstein cows, and to guide the construction of ant-mastitis populations and improve the populations’ anti-mastitis characteristic in dairy cows. Therefore, reduced-representation sequencing (2b-RAD) used to systematical study the conventional genetic variation (direct genotyping) of Chinese Holstein cows. And then rely on two-stage correlation analysis to find significant SNPs associated with risk of mastitis. Finally, we screened out three significant SNPs (rs75762330, rs88640083 and rs20438858) associated with immune response and inflammation, which suggested that these three genes (PTK2B, SYK and TNFRSF21) are novel candidate genes associated with mastitis traits in Chinese Holstein cows.

Abbreviations

2b-RAD, type IIB endonucleases restriction-site associated DNA; GWAS, Genomic wide association studies; SNPs, single nucleotide polymorphisms; PTK2B, protein tyrosine kinase 2; SYK, spleen tyrosine kinase; TNFRSF21, tumor necrosis factor superfamily member 21; TLR4, Toll-like receptor 4; NF-κB, nuclear factor-kappa B; PCA, principal component analysis; PC, principal component; Q-Q plots, Quantile-Quantile plots; OR, Estimated odds ratio; PIC, Polymorphism Information Content; He, Heterozygosity Expectation; Ho, Heterozygosity Observation.

Ethics approval and consent to participate

This experimental animal and its care protocol followed previous studies. National and local animal welfare agencies (Jiangsu province, China; Nanjing Agricultural University and Nanjing Weigang Dairy Co., Ltd; Approval No.20160615) approved all experimental animal procedures in this study.

Consent for publication

The authors announced their agreement to publish the manuscript

Availability of data and materials

The entire genome reference sequence raw data is based on the sequence provided by the NCBI database (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/055/GCF_000003055.6_Bos_taurus_UMD_3.1.1/GCF_000003055.6_Bos_taurus_UMD_3.1.1_genomic.fna.gz).

Summary information and download links for all SNPs (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/055/GCF_000003055.6_Bos_taurus_UMD_3.1.1/GCF_000003055.6_Bos_taurus_UMD_3.1.1_genomic.gff.gz).

Supplementary data are supported by diagrams and tables as powerful data for manuscripts.

Competing interests

The authors claim that they did not have competing interests.

Funding

This study was supported by Agricultural Innovation fund of Jiangsu Province [grant no. CX(17)1005]; National Natural Science Foundation of China (grant no. 31372207); Innovation Team of Scientific Research Platform in Anhui Province; Start-up grant from Nanjing Agricultural University (grant no. 804090); “Sanxin” Research Program of Jiangsu Province (grant no. SXG [2016]312)

Authors’ contributions

YC, JL, HL and GW co-designed the experiment. FC, LL, CL, DY, JC, CX and JL participated in the screening of experimental animals and blood sample extraction. FY and FC sorted and analyzed 2b-RAD sequencing data. FY and TB wrote and calibrated experiments manuscripts. The final manuscript was read and authorized by all authors.

Table 1 Significant SNPs screened by the Bayesian Model and Logical regression analysis model

Note: * indicated the P-value calculated by Chi-square (<0.001); ** is the t-statistic P-value of the logical regression model (<0.01);

CHISQ is Chi-square under Chi-square test. STAT is the t-statistic coefficient under the logistic regression model.

OR: Estimated odds ratio. L95: Lower bound of 95% confidence interval for odds ratio. U95: Upper bound of 95% confidence interval for odds ratio. Nan: meaningless number. Na: missing value or Not available.

Table 2 Significant SNPs genetic diversity and their Go enrichment annotations

Note: He: Heterozygosity Expectation. Ho: Heterozygosity Observation. PIC: Polymorphism Information Content.

Table 3 GO enrichment items and genes of three significant SNPs.

Table 4 Case-control study analyzed three significant SNPs in independent validation population.

Acknowledgements

We thank Nanjing Weigang Dairy Co., Ltd. for providing experimental Chinese Holstein blood samples and Shanghai Oe Biotech Co., Ltd. for providing 2b-RAD genome sequencing technology support.

Footnotes

↵+ Co-first author: FAN Yang: yangfanbridge{at}ahnu.edu.cn; Fanghui Chen: 2017205005{at}njau.edu.cn
Other authors: Lili Li: lily219_lee{at}163.com; Li Yan: yanli1995{at}126.com; Tarig Badrib: 2014205028{at}njau.edu.cn; Chenglong Lv: 2698145037{at}qq.com; Daolun Yu13965466460{at}126.cm; Jie Chen: chenjie1006{at}foxmail.com; Chaofeng Xing: 1820491215{at}qq.com; Jie Li: 1660700720{at}qq.com; Genlin Wang: glwang{at}njau.edu.cn; Honglin Li: HLI{at}Augusta.edu.

Reference

↵
Arranz, A., C. Doxaki, E. Vergadi, Y. Martinez de la Torre, K. Vaporidi et al., 2012 Akt1 and Akt2 protein kinases differentially contribute to macrophage polarization. Proc Natl Acad Sci U S A 109: 9517–9522.
OpenUrl Abstract/FREE Full Text
↵
Bagheri, M., M. Moradi-Sharhrbabak, R. Miraie-Ashtiani, M. Safdari-Shahroudi and R. Abdollahi-Arpanahi, 2016 Case-control approach application for finding a relationship between candidate genes and clinical mastitis in Holstein dairy cattle. J Appl Genet 57: 107–112.
OpenUrl
↵
Berndt, S. I., S. Gustafsson, R. Magi, A. Ganna, E. Wheeler et al., 2013 Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture. Nat Genet 45: 501–512.
OpenUrl CrossRef PubMed
↵
Bhattarai, D., X. Chen, Z. Ur Rehman, X. Hao, F. Ullah et al., 2017 Association of MAP4K4 gene single nucleotide polymorphism with mastitis and milk traits in Chinese Holstein cattle. J Dairy Res 84: 76–79.
OpenUrl
↵
Biscarini, F., H. Schwarzenbacher, H. Pausch, E. L. Nicolazzi, Y. Pirola et al., 2016 Use of SNP genotypes to identify carriers of harmful recessive mutations in cattle populations. BMC Genomics 17: 857.
OpenUrl
↵
Brondum, R. F., G. Su, L. Janss, G. Sahana, B. Guldbrandtsen et al., 2015 Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction. J Dairy Sci 98: 4107–4116.
OpenUrl
↵
Chen, C. C., R. B. Boxer, D. B. Stairs, C. P. Portocarrero, R. H. Horton et al., 2010 Akt is required for Stat5 activation and mammary differentiation. Breast Cancer Res 12: R72.
OpenUrl CrossRef PubMed
↵
Choi, S. H., A. Gonen, C. J. Diehl, J. Kim, F. Almazan et al., 2015 SYK regulates macrophage MHC-II expression via activation of autophagy in response to oxidized LDL. Autophagy 11: 785–795.
OpenUrl
↵
Creamer, B. A., K. Sakamoto, J. W. Schmidt, A. A. Triplett, R. Moriggl et al., 2010 Stat5 promotes survival of mammary epithelial cells through transcriptional activation of a distinct promoter in Akt1. Mol Cell Biol 30: 2957–2970.
OpenUrl Abstract/FREE Full Text
↵
Crispim, A. C., M. J. Kelly, S. E. Guimaraes, F. Fonseca e Silva, M. R. Fortes et al., 2015 Multi-Trait GWAS and New Candidate Genes Annotation for Growth Curve Parameters in Brahman Cattle. PLoS One 10: e0139906.
OpenUrl
↵
Cross-Disorder Group of the Psychiatric Genomics, C., S. H. Lee, S. Ripke, B. M. Neale, S. V. Faraone et al., 2013 Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. Nat Genet 45: 984–994.
OpenUrl CrossRef PubMed
↵
Daetwyler, H. D., A. Capitan, H. Pausch, P. Stothard, R. van Binsbergen et al., 2014 Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle. Nat Genet 46: 858–865.
OpenUrl CrossRef PubMed
↵
Deelen, J., H. W. Uh, R. Monajemi, D. van Heemst, P. E. Thijssen et al., 2013 Gene set analysis of GWAS data for human longevity highlights the relevance of the insulin/IGF-1 signaling and telomere maintenance pathways. Age (Dordr) 35: 235–249.
OpenUrl CrossRef PubMed
↵
Dickel, D. E., A. R. Ypsilanti, R. Pla, Y. Zhu, I. Barozzi et al., 2018 Ultraconserved Enhancers Are Required for Normal Development. Cell 172: 491–499 e415.
OpenUrl CrossRef PubMed
↵
Fujikura, D., M. Ikesue, T. Endo, S. Chiba, H. Higashi et al., 2017 Death receptor 6 contributes to autoimmunity in lupus-prone mice. Nat Commun 8: 13957.
OpenUrl
↵
Gianola, D., 2013 Priors in whole-genome regression: the bayesian alphabet returns. Genetics 194: 573–596.
OpenUrl Abstract/FREE Full Text
↵
Guo, P., B. Zhu, H. Niu, Z. Wang, Y. Liang et al., 2018 Fast genomic prediction of breeding values using parallel Markov chain Monte Carlo with convergence diagnosis. BMC Bioinformatics 19: 3.
OpenUrl
↵
Guo, Y., H. Yuan, D. Fang, L. Song, Y. Liu et al., 2014 An improved 2b-RAD approach (I2b-RAD) offering genotyping tested by a rice (Oryza sativa L.) F2 population. BMC Genomics 15: 956.
OpenUrl
↵
Halasa, T., K. Huijps, O. Osteras and H. Hogeveen, 2007 Economic effects of bovine mastitis and mastitis management: a review. Vet Q 29: 18–31.
OpenUrl PubMed Web of Science
↵
Halasa, T., M. Nielen, A. P. De Roos, R. Van Hoorne, G. de Jong et al., 2009 Production loss due to new subclinical mastitis in Dutch dairy cows estimated with a test-day model. J Dairy Sci 92: 599–606.
OpenUrl CrossRef PubMed Web of Science
↵
Haudry, A., A. E. Platts, E. Vello, D. R. Hoen, M. Leclercq et al., 2013 An atlas of over 90,000 conserved noncoding sequences provides insight into crucifer regulatory regions. Nat Genet 45: 891–898.
OpenUrl CrossRef PubMed
↵
Hernandez-Castro, L. E., M. Paterno, A. G. Villacis, B. Andersson, J. A. Costales et al., 2017 2b-RAD genotyping for population genomic studies of Chagas disease vectors: Rhodnius ecuadoriensis in Ecuador. PLoS Negl Trop Dis 11: e0005710.
OpenUrl
↵
Hertl, J. A., Y. H. Schukken, F. L. Welcome, L. W. Tauer and Y. T. Grohn, 2014 Pathogen-specific effects on milk yield in repeated clinical mastitis episodes in Holstein dairy cows. J Dairy Sci 97: 1465–1480.
OpenUrl
↵
Hindorff, L. A., P. Sethupathy, H. A. Junkins, E. M. Ramos, J. P. Mehta et al., 2009 Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A 106: 9362–9367.
OpenUrl Abstract/FREE Full Text
↵
Hogeveen, H., K. Huijps and T. J. Lam, 2011 Economic aspects of mastitis: new developments. N Z Vet J 59: 16–23.
OpenUrl CrossRef PubMed Web of Science
↵
Hou, X., L. Lin, W. Xing, Y. Yang, X. Duan et al., 2016 Spleen tyrosine kinase regulates mammary epithelial cell proliferation in mammary glands of dairy cows. J Dairy Sci 99: 3858–3868.
OpenUrl
↵
Jiao, W., X. Fu, J. Dou, H. Li, H. Su et al., 2014 High-resolution linkage and quantitative trait locus mapping aided by genome survey sequencing: building up an integrative genomic framework for a bivalve mollusc. DNA Res 21: 85–101.
OpenUrl CrossRef PubMed
↵
Kadri, N. K., B. Guldbrandtsen, M. S. Lund and G. Sahana, 2015 Genetic dissection of milk yield traits and mastitis resistance quantitative trait loci on chromosome 20 in dairy cattle. J Dairy Sci 98: 9015–9025.
OpenUrl
↵
Khalil, E., M. R. Digby, P. C. Thomson, C. Lefevre, S. L. Mailer et al., 2011 Acute involution in the tammar wallaby: identification of genes and putative novel milk proteins implicated in mammary gland function. Genomics 97: 372–378.
OpenUrl CrossRef PubMed
↵
Kiku, Y., T. Ozawa, H. Takahashi, S. Kushibiki, S. Inumaru et al., 2017 Effect of intramammary infusion of recombinant bovine GM-CSF and IL-8 on CMT score, somatic cell count, and milk mononuclear cell populations in Holstein cows with Staphylococcus aureus subclinical mastitis. Vet Res Commun 41: 175–182.
OpenUrl
↵
Kremer, A. N., J. C. van der Griendt, E. D. van der Meijden, M. W. Honders, B. Ayoglu et al., 2014 Development of a coordinated allo T cell and auto B cell response against autosomal PTK2B after allogeneic hematopoietic stem cell transplantation. Haematologica 99: 365–369.
OpenUrl Abstract/FREE Full Text
↵
Lee, S. H., T. R. DeCandia, S. Ripke, J. Yang, C. Schizophrenia Psychiatric Genome-Wide Association Study et al., 2012 Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs. Nat Genet 44: 247–250.
OpenUrl CrossRef PubMed
↵
Li, Z., J. Chen, H. Yu, L. He, Y. Xu et al., 2017 Genome-wide association analysis identifies 30 new susceptibility loci for schizophrenia. Nat Genet 49: 1576–1583.
OpenUrl CrossRef
↵
Llewellyn, R. A., K. S. Thomas, M. F. Gutknecht and A. H. Bouton, 2017 The nonreceptor protein tyrosine kinase Pyk2 promotes the turnover of monocytes at steady state. J Leukoc Biol 102: 1069–1080.
OpenUrl CrossRef PubMed
↵
Locksley, R. M., N. Killeen and M. J. Lenardo, 2001 The TNF and TNF receptor superfamilies: integrating mammalian biology. Cell 104: 487–501.
OpenUrl CrossRef PubMed Web of Science
↵
Loh, P. R., G. Bhatia, A. Gusev, H. K. Finucane, B. K. Bulik-Sullivan et al., 2015 Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nat Genet 47: 1385–1392.
OpenUrl CrossRef PubMed
↵
Luo, X., X. Shi, C. Yuan, M. Ai, C. Ge et al., 2017 Genome-wide SNP analysis using 2b-RAD sequencing identifies the candidate genes putatively associated with resistance to ivermectin in Haemonchus contortus. Parasit Vectors 10: 31.
OpenUrl
↵
Maroulakou, I. G., W. Oemler, S. P. Naber, I. Klebba, C. Kuperwasser et al., 2008 Distinct roles of the three Akt isoforms in lactogenic differentiation and involution. J Cell Physiol 217: 468–477.
OpenUrl CrossRef PubMed
↵
Maunsell, F. P., D. E. Morin, P. D. Constable, W. L. Hurley, G. C. McCoy et al., 1998 Effects of mastitis on the volume and composition of colostrum produced by Holstein cows. J Dairy Sci 81: 1291–1299.
OpenUrl PubMed
↵
Meredith, B. K., F. J. Kearney, E. K. Finlay, D. G. Bradley, A. G. Fahey et al., 2012 Genome-wide associations for milk production and somatic cell score in Holstein-Friesian cattle in Ireland. BMC Genet 13: 21.
OpenUrl PubMed
↵
Meuwissen, T. H., B. J. Hayes and M. E. Goddard, 2001 Prediction of total genetic value using genome-wide dense marker maps. Genetics 157: 1819–1829.
OpenUrl Abstract/FREE Full Text
↵
Mocsai, A., J. Ruland and V. L. Tybulewicz, 2010 The SYK tyrosine kinase: a crucial player in diverse biological functions. Nat Rev Immunol 10: 387–402.
OpenUrl CrossRef PubMed
↵
Moosavi, M., A. Mirzaei, M. Ghavami and A. Tamadon, 2014 Relationship between season, lactation number and incidence of clinical mastitis in different stages of lactation in a Holstein dairy farm. Vet Res Forum 5: 13–19.
OpenUrl
↵
Moser, G., B. Tier, R. E. Crump, M. S. Khatkar and H. W. Raadsma, 2009 A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers. Genet Sel Evol 41: 56.
OpenUrl CrossRef PubMed
↵
Patrushev, L. I., and T. F. Kovalenko, 2014 Functions of noncoding sequences in mammalian genomes. Biochemistry (Mosc) 79: 1442–1469.
OpenUrl
↵
Pauletto, M., L. Carraro, M. Babbucci, R. Lucchini, L. Bargelloni et al., 2016 Extending RAD tag analysis to microbial ecology: a comparison between MultiLocus Sequence Typing and 2b-RAD to investigate Listeria monocytogenes genetic structure. Mol Ecol Resour 16: 823–835.
OpenUrl CrossRef PubMed
↵
Petibon, C., J. Parenteau, M. Catala and S. A. Elela, 2016 Introns regulate the production of ribosomal proteins by modulating splicing of duplicated ribosomal protein genes. Nucleic Acids Res 44: 3878–3891.
OpenUrl CrossRef PubMed
↵
Pokorska, J., M. Dusza, D. Kulaj, K. Zukowski and J. Makulska, 2016 Single nucleotide polymorphisms in the CXCR1 gene and its association with clinical mastitis incidence in Polish Holstein-Friesian cows. Genet Mol Res 15.
↵
Racioppi, L., P. K. Noeldner, F. Lin, S. Arvai and A. R. Means, 2012 Calcium/calmodulin-dependent protein kinase kinase 2 regulates macrophage-mediated inflammatory responses. J Biol Chem 287: 11579–11591.
OpenUrl Abstract/FREE Full Text
↵
Resende, M. F., Jr.., P. Munoz, M. D. Resende, D. J. Garrick, R. L. Fernando et al., 2012 Accuracy of genomic selection methods in a standard data set of loblolly pine (Pinus taeda L.). Genetics 190: 1503–1510.
OpenUrl Abstract/FREE Full Text
↵
Rhee, I., M. C. Zhong, B. Reizis, C. Cheong and A. Veillette, 2014 Control of dendritic cell migration, T cell-dependent immunity, and autoimmunity by protein tyrosine phosphatase PTPN12 expressed in dendritic cells. Mol Cell Biol 34: 888–899.
OpenUrl Abstract/FREE Full Text
↵
Rubin, C. J., M. C. Zody, J. Eriksson, J. R. Meadows, E. Sherwood et al., 2010 Whole-genome resequencing reveals loci under selection during chicken domestication. Nature 464: 587–591.
OpenUrl CrossRef PubMed Web of Science
↵
Sahana, G., B. Guldbrandtsen, B. Thomsen, L. E. Holm, F. Panitz et al., 2014 Genome-wide association study using high-density single nucleotide polymorphism arrays and whole-genome sequences for clinical mastitis traits in dairy cattle. J Dairy Sci 97: 7258–7275.
OpenUrl
↵
Saowaphak, P., M. Duangjinda, S. Plaengkaeo, R. Suwannasing and W. Boonkum, 2017 Genetic correlation and genome-wide association study (GWAS) of the length of productive life, days open, and 305-days milk yield in crossbred Holstein dairy cattle. Genet Mol Res 16.
↵
Schukken, Y. H., J. Hertl, D. Bar, G. J. Bennett, R. N. Gonzalez et al., 2009 Effects of repeated gram-positive and gram-negative clinical mastitis episodes on milk yield loss in Holstein dairy cows. J Dairy Sci 92: 3091–3105.
OpenUrl CrossRef PubMed
↵
Schweighoffer, E., J. Nys, L. Vanes, N. Smithers and V. L. J. Tybulewicz, 2017 TLR4 signals in B lymphocytes are transduced via the B cell antigen receptor and SYK. J Exp Med 214: 1269–1280.
OpenUrl Abstract/FREE Full Text
↵
Selimovic-Hamza, S., C. L. Boujon, M. Hilbe, A. Oevermann and T. Seuberlich, 2017 Frequency and Pathological Phenotype of Bovine Astrovirus CH13/NeuroS1 Infection in Neurologically-Diseased Cattle: Towards Assessment of Causality. Viruses 9.
↵
Sodeland, M., M. P. Kent, H. G. Olsen, M. A. Opsal, M. Svendsen et al., 2011 Quantitative trait loci for clinical mastitis on chromosomes 2, 6, 14 and 20 in Norwegian Red cattle. Anim Genet 42: 457–465.
OpenUrl PubMed
↵
Strilic, B., L. Yang, J. Albarran-Juarez, L. Wachsmuth, K. Han et al., 2016 Tumour-cell-induced endothelial cell necroptosis via death receptor 6 promotes metastasis. Nature 536: 215–218.
OpenUrl
↵
Swinkels, J. M., H. Hogeveen and R. N. Zadoks, 2005 A partial budget model to estimate economic benefits of lactational treatment of subclinical Staphylococcus aureus mastitis. J Dairy Sci 88: 4273–4287.
OpenUrl PubMed
↵
Tiezzi, F., K. L. Parker-Gaddis, J. B. Cole, J. S. Clay and C. Maltecca, 2015 A genome-wide association study for clinical mastitis in first parity US Holstein cows using single-step approach and genomic matrix re-weighting procedure. PLoS One 10: e0114919.
OpenUrl
↵
Usman, T., Y. Wang, C. Liu, X. Wang, Y. Zhang et al., 2015 Association study of single nucleotide polymorphisms in JAK2 and STAT5B genes and their differential mRNA expression with mastitis susceptibility in Chinese Holstein cattle. Anim Genet 46: 371–380.
OpenUrl
↵
Varshney, R. K., R. K. Saxena, H. D. Upadhyaya, A. W. Khan, Y. Yu et al., 2017 Whole-genome resequencing of 292 pigeonpea accessions identifies genomic regions associated with domestication and agronomic traits. Nat Genet 49: 1082–1088.
OpenUrl
↵
Visscher, P. M., N. R. Wray, Q. Zhang, P. Sklar, M. I. McCarthy et al., 2017 10 Years of GWAS Discovery: Biology, Function, and Translation. Am J Hum Genet 101: 5–22.
OpenUrl CrossRef PubMed
↵
Visser, M., R. J. Palstra and M. Kayser, 2014 Human skin color is influenced by an intergenic DNA polymorphism regulating transcription of the nearby BNC2 pigmentation gene. Hum Mol Genet 23: 5750–5762.
OpenUrl CrossRef PubMed Web of Science
↵
Wang, S., P. Liu, J. Lv, Y. Li, T. Cheng et al., 2016a Serial sequencing of isolength RAD tags for cost-efficient genome-wide profiling of genetic and epigenetic variations. Nat Protoc 11: 2189–2200.
OpenUrl CrossRef
↵
Wang, S., E. Meyer, J. K. McKay and M. V. Matz, 2012 2b-RAD: a simple and flexible method for genome-wide genotyping. Nat Methods 9: 808–810.
OpenUrl CrossRef PubMed Web of Science
↵
Wang, S., Y. Zhang, W. Dai, K. Lauter, M. Kim et al., 2016b HEALER: homomorphic computation of ExAct Logistic rEgRession for secure rare disease variants analysis in GWAS. Bioinformatics 32: 211–218.
OpenUrl CrossRef PubMed
↵
Wang, X., P. Ma, J. Liu, Q. Zhang, Y. Zhang et al., 2015 Genome-wide association study in Chinese Holstein cows reveal two candidate genes for somatic cell score as an indicator for mastitis susceptibility. BMC Genet 16: 111.
OpenUrl
↵
Weissbrod, O., J. Flint and S. Rosset, 2018 Estimating SNP-Based Heritability and Genetic Correlation in Case-Control Studies Directly and with Summary Statistics. Am J Hum Genet 103: 89–99.
OpenUrl CrossRef
↵
Welderufael, B. G., L. L. G. Janss, D. J. de Koning, L. P. Sorensen, P. Lovendahl et al., 2017 Bivariate threshold models for genetic evaluation of susceptibility to and ability to recover from mastitis in Danish Holstein cows. J Dairy Sci 100: 4706–4720.
OpenUrl
↵
Wellcome Trust Case Control, C., J. B. Maller, G. McVean, J. Byrnes, D. Vukcevic et al., 2012 Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat Genet 44: 1294–1301.
OpenUrl CrossRef PubMed
↵
Wiggans, G. R., T. S. Sonstegard, P. M. VanRaden, L. K. Matukumalli, R. D. Schnabel et al., 2009 Selection of single-nucleotide polymorphisms and quality of genotypes used in genomic evaluation of dairy cattle in the United States and Canada. J Dairy Sci 92: 3431–3436.
OpenUrl CrossRef PubMed
↵
Wijga, S., J. W. Bastiaansen, E. Wall, E. Strandberg, Y. de Haas et al., 2012 Genomic associations with somatic cell score in first-lactation Holstein cows. J Dairy Sci 95: 899–908.
OpenUrl PubMed
↵
Wu, X., M. S. Lund, G. Sahana, B. Guldbrandtsen, D. Sun et al., 2015 Association analysis for udder health based on SNP-panel and sequence data in Danish Holsteins. Genet Sel Evol 47: 50.
OpenUrl
↵
Xu, X., Y. Hou, X. Yin, L. Bao, A. Tang et al., 2012 Single-cell exome sequencing reveals single-nucleotide mutation characteristics of a kidney tumor. Cell 148: 886–895.
OpenUrl CrossRef PubMed Web of Science
↵
Yang, J., S. H. Lee, M. E. Goddard and P. M. Visscher, 2011 GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 88: 76–82.
OpenUrl CrossRef PubMed
↵
Zhang, Q., B. Guldbrandtsen, J. R. Thomasen, M. S. Lund and G. Sahana, 2016 Genome-wide association study for longevity with whole-genome sequencing in 3 cattle breeds. J Dairy Sci 99: 7289–7298.
OpenUrl
↵
Zhao, Y., K. Su, G. Wang, L. Zhang, J. Zhang et al., 2017 High-Density Genetic Linkage Map Construction and Quantitative Trait Locus Mapping for Hawthorn (Crataegus pinnatifida Bunge). Sci Rep 7: 5492.
OpenUrl
↵
Zhou, W., J. B. Nielsen, L. G. Fritsche, R. Dey, M. E. Gabrielsen et al., 2018 Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat Genet 50: 1335–1341.
OpenUrl CrossRef PubMed

View the discussion thread.

Posted October 03, 2018.

Download PDF

Citation Tools

Subject Area

Genomics

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11739)
Bioengineering (8750)
Bioinformatics (29189)
Biophysics (14967)
Cancer Biology (12093)
Cell Biology (17409)
Clinical Trials (138)
Developmental Biology (9419)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18301)
Genetics (12238)
Genomics (16797)
Immunology (11865)
Microbiology (28068)
Molecular Biology (11583)
Neuroscience (60953)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4957)
Plant Biology (10425)
Scientific Communication and Education (1683)
Synthetic Biology (2884)
Systems Biology (7338)
Zoology (1651)

[1] ↵
Arranz, A., C. Doxaki, E. Vergadi, Y. Martinez de la Torre, K. Vaporidi et al., 2012 Akt1 and Akt2 protein kinases differentially contribute to macrophage polarization. Proc Natl Acad Sci U S A 109: 9517–9522.
OpenUrl Abstract/FREE Full Text

[2] ↵
Bagheri, M., M. Moradi-Sharhrbabak, R. Miraie-Ashtiani, M. Safdari-Shahroudi and R. Abdollahi-Arpanahi, 2016 Case-control approach application for finding a relationship between candidate genes and clinical mastitis in Holstein dairy cattle. J Appl Genet 57: 107–112.
OpenUrl

[3] ↵
Berndt, S. I., S. Gustafsson, R. Magi, A. Ganna, E. Wheeler et al., 2013 Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture. Nat Genet 45: 501–512.
OpenUrl CrossRef PubMed

[4] ↵
Bhattarai, D., X. Chen, Z. Ur Rehman, X. Hao, F. Ullah et al., 2017 Association of MAP4K4 gene single nucleotide polymorphism with mastitis and milk traits in Chinese Holstein cattle. J Dairy Res 84: 76–79.
OpenUrl

[5] ↵
Biscarini, F., H. Schwarzenbacher, H. Pausch, E. L. Nicolazzi, Y. Pirola et al., 2016 Use of SNP genotypes to identify carriers of harmful recessive mutations in cattle populations. BMC Genomics 17: 857.
OpenUrl

[6] ↵
Brondum, R. F., G. Su, L. Janss, G. Sahana, B. Guldbrandtsen et al., 2015 Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction. J Dairy Sci 98: 4107–4116.
OpenUrl

[7] ↵
Chen, C. C., R. B. Boxer, D. B. Stairs, C. P. Portocarrero, R. H. Horton et al., 2010 Akt is required for Stat5 activation and mammary differentiation. Breast Cancer Res 12: R72.
OpenUrl CrossRef PubMed

[8] ↵
Choi, S. H., A. Gonen, C. J. Diehl, J. Kim, F. Almazan et al., 2015 SYK regulates macrophage MHC-II expression via activation of autophagy in response to oxidized LDL. Autophagy 11: 785–795.
OpenUrl

[9] ↵
Creamer, B. A., K. Sakamoto, J. W. Schmidt, A. A. Triplett, R. Moriggl et al., 2010 Stat5 promotes survival of mammary epithelial cells through transcriptional activation of a distinct promoter in Akt1. Mol Cell Biol 30: 2957–2970.
OpenUrl Abstract/FREE Full Text

[10] ↵
Crispim, A. C., M. J. Kelly, S. E. Guimaraes, F. Fonseca e Silva, M. R. Fortes et al., 2015 Multi-Trait GWAS and New Candidate Genes Annotation for Growth Curve Parameters in Brahman Cattle. PLoS One 10: e0139906.
OpenUrl

[11] ↵
Cross-Disorder Group of the Psychiatric Genomics, C., S. H. Lee, S. Ripke, B. M. Neale, S. V. Faraone et al., 2013 Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. Nat Genet 45: 984–994.
OpenUrl CrossRef PubMed

[12] ↵
Daetwyler, H. D., A. Capitan, H. Pausch, P. Stothard, R. van Binsbergen et al., 2014 Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle. Nat Genet 46: 858–865.
OpenUrl CrossRef PubMed

[13] ↵
Deelen, J., H. W. Uh, R. Monajemi, D. van Heemst, P. E. Thijssen et al., 2013 Gene set analysis of GWAS data for human longevity highlights the relevance of the insulin/IGF-1 signaling and telomere maintenance pathways. Age (Dordr) 35: 235–249.
OpenUrl CrossRef PubMed

[14] ↵
Dickel, D. E., A. R. Ypsilanti, R. Pla, Y. Zhu, I. Barozzi et al., 2018 Ultraconserved Enhancers Are Required for Normal Development. Cell 172: 491–499 e415.
OpenUrl CrossRef PubMed

[15] ↵
Fujikura, D., M. Ikesue, T. Endo, S. Chiba, H. Higashi et al., 2017 Death receptor 6 contributes to autoimmunity in lupus-prone mice. Nat Commun 8: 13957.
OpenUrl

[16] ↵
Gianola, D., 2013 Priors in whole-genome regression: the bayesian alphabet returns. Genetics 194: 573–596.
OpenUrl Abstract/FREE Full Text

[17] ↵
Guo, P., B. Zhu, H. Niu, Z. Wang, Y. Liang et al., 2018 Fast genomic prediction of breeding values using parallel Markov chain Monte Carlo with convergence diagnosis. BMC Bioinformatics 19: 3.
OpenUrl

[18] ↵
Guo, Y., H. Yuan, D. Fang, L. Song, Y. Liu et al., 2014 An improved 2b-RAD approach (I2b-RAD) offering genotyping tested by a rice (Oryza sativa L.) F2 population. BMC Genomics 15: 956.
OpenUrl

[19] ↵
Halasa, T., K. Huijps, O. Osteras and H. Hogeveen, 2007 Economic effects of bovine mastitis and mastitis management: a review. Vet Q 29: 18–31.
OpenUrl PubMed Web of Science

[20] ↵
Halasa, T., M. Nielen, A. P. De Roos, R. Van Hoorne, G. de Jong et al., 2009 Production loss due to new subclinical mastitis in Dutch dairy cows estimated with a test-day model. J Dairy Sci 92: 599–606.
OpenUrl CrossRef PubMed Web of Science

[21] ↵
Haudry, A., A. E. Platts, E. Vello, D. R. Hoen, M. Leclercq et al., 2013 An atlas of over 90,000 conserved noncoding sequences provides insight into crucifer regulatory regions. Nat Genet 45: 891–898.
OpenUrl CrossRef PubMed

[22] ↵
Hernandez-Castro, L. E., M. Paterno, A. G. Villacis, B. Andersson, J. A. Costales et al., 2017 2b-RAD genotyping for population genomic studies of Chagas disease vectors: Rhodnius ecuadoriensis in Ecuador. PLoS Negl Trop Dis 11: e0005710.
OpenUrl

[23] ↵
Hertl, J. A., Y. H. Schukken, F. L. Welcome, L. W. Tauer and Y. T. Grohn, 2014 Pathogen-specific effects on milk yield in repeated clinical mastitis episodes in Holstein dairy cows. J Dairy Sci 97: 1465–1480.
OpenUrl

[24] ↵
Hindorff, L. A., P. Sethupathy, H. A. Junkins, E. M. Ramos, J. P. Mehta et al., 2009 Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A 106: 9362–9367.
OpenUrl Abstract/FREE Full Text

[25] ↵
Hogeveen, H., K. Huijps and T. J. Lam, 2011 Economic aspects of mastitis: new developments. N Z Vet J 59: 16–23.
OpenUrl CrossRef PubMed Web of Science

[26] ↵
Hou, X., L. Lin, W. Xing, Y. Yang, X. Duan et al., 2016 Spleen tyrosine kinase regulates mammary epithelial cell proliferation in mammary glands of dairy cows. J Dairy Sci 99: 3858–3868.
OpenUrl

[27] ↵
Jiao, W., X. Fu, J. Dou, H. Li, H. Su et al., 2014 High-resolution linkage and quantitative trait locus mapping aided by genome survey sequencing: building up an integrative genomic framework for a bivalve mollusc. DNA Res 21: 85–101.
OpenUrl CrossRef PubMed

[28] ↵
Kadri, N. K., B. Guldbrandtsen, M. S. Lund and G. Sahana, 2015 Genetic dissection of milk yield traits and mastitis resistance quantitative trait loci on chromosome 20 in dairy cattle. J Dairy Sci 98: 9015–9025.
OpenUrl

[29] ↵
Khalil, E., M. R. Digby, P. C. Thomson, C. Lefevre, S. L. Mailer et al., 2011 Acute involution in the tammar wallaby: identification of genes and putative novel milk proteins implicated in mammary gland function. Genomics 97: 372–378.
OpenUrl CrossRef PubMed

[30] ↵
Kiku, Y., T. Ozawa, H. Takahashi, S. Kushibiki, S. Inumaru et al., 2017 Effect of intramammary infusion of recombinant bovine GM-CSF and IL-8 on CMT score, somatic cell count, and milk mononuclear cell populations in Holstein cows with Staphylococcus aureus subclinical mastitis. Vet Res Commun 41: 175–182.
OpenUrl

[31] ↵
Kremer, A. N., J. C. van der Griendt, E. D. van der Meijden, M. W. Honders, B. Ayoglu et al., 2014 Development of a coordinated allo T cell and auto B cell response against autosomal PTK2B after allogeneic hematopoietic stem cell transplantation. Haematologica 99: 365–369.
OpenUrl Abstract/FREE Full Text

[32] ↵
Lee, S. H., T. R. DeCandia, S. Ripke, J. Yang, C. Schizophrenia Psychiatric Genome-Wide Association Study et al., 2012 Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs. Nat Genet 44: 247–250.
OpenUrl CrossRef PubMed

[33] ↵
Li, Z., J. Chen, H. Yu, L. He, Y. Xu et al., 2017 Genome-wide association analysis identifies 30 new susceptibility loci for schizophrenia. Nat Genet 49: 1576–1583.
OpenUrl CrossRef

[34] ↵
Llewellyn, R. A., K. S. Thomas, M. F. Gutknecht and A. H. Bouton, 2017 The nonreceptor protein tyrosine kinase Pyk2 promotes the turnover of monocytes at steady state. J Leukoc Biol 102: 1069–1080.
OpenUrl CrossRef PubMed

[35] ↵
Locksley, R. M., N. Killeen and M. J. Lenardo, 2001 The TNF and TNF receptor superfamilies: integrating mammalian biology. Cell 104: 487–501.
OpenUrl CrossRef PubMed Web of Science

[36] ↵
Loh, P. R., G. Bhatia, A. Gusev, H. K. Finucane, B. K. Bulik-Sullivan et al., 2015 Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nat Genet 47: 1385–1392.
OpenUrl CrossRef PubMed

[37] ↵
Luo, X., X. Shi, C. Yuan, M. Ai, C. Ge et al., 2017 Genome-wide SNP analysis using 2b-RAD sequencing identifies the candidate genes putatively associated with resistance to ivermectin in Haemonchus contortus. Parasit Vectors 10: 31.
OpenUrl

[38] ↵
Maroulakou, I. G., W. Oemler, S. P. Naber, I. Klebba, C. Kuperwasser et al., 2008 Distinct roles of the three Akt isoforms in lactogenic differentiation and involution. J Cell Physiol 217: 468–477.
OpenUrl CrossRef PubMed

[39] ↵
Maunsell, F. P., D. E. Morin, P. D. Constable, W. L. Hurley, G. C. McCoy et al., 1998 Effects of mastitis on the volume and composition of colostrum produced by Holstein cows. J Dairy Sci 81: 1291–1299.
OpenUrl PubMed

[40] ↵
Meredith, B. K., F. J. Kearney, E. K. Finlay, D. G. Bradley, A. G. Fahey et al., 2012 Genome-wide associations for milk production and somatic cell score in Holstein-Friesian cattle in Ireland. BMC Genet 13: 21.
OpenUrl PubMed

[41] ↵
Meuwissen, T. H., B. J. Hayes and M. E. Goddard, 2001 Prediction of total genetic value using genome-wide dense marker maps. Genetics 157: 1819–1829.
OpenUrl Abstract/FREE Full Text

[42] ↵
Mocsai, A., J. Ruland and V. L. Tybulewicz, 2010 The SYK tyrosine kinase: a crucial player in diverse biological functions. Nat Rev Immunol 10: 387–402.
OpenUrl CrossRef PubMed

[43] ↵
Moosavi, M., A. Mirzaei, M. Ghavami and A. Tamadon, 2014 Relationship between season, lactation number and incidence of clinical mastitis in different stages of lactation in a Holstein dairy farm. Vet Res Forum 5: 13–19.
OpenUrl

[44] ↵
Moser, G., B. Tier, R. E. Crump, M. S. Khatkar and H. W. Raadsma, 2009 A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers. Genet Sel Evol 41: 56.
OpenUrl CrossRef PubMed

[45] ↵
Patrushev, L. I., and T. F. Kovalenko, 2014 Functions of noncoding sequences in mammalian genomes. Biochemistry (Mosc) 79: 1442–1469.
OpenUrl

[46] ↵
Pauletto, M., L. Carraro, M. Babbucci, R. Lucchini, L. Bargelloni et al., 2016 Extending RAD tag analysis to microbial ecology: a comparison between MultiLocus Sequence Typing and 2b-RAD to investigate Listeria monocytogenes genetic structure. Mol Ecol Resour 16: 823–835.
OpenUrl CrossRef PubMed

[47] ↵
Petibon, C., J. Parenteau, M. Catala and S. A. Elela, 2016 Introns regulate the production of ribosomal proteins by modulating splicing of duplicated ribosomal protein genes. Nucleic Acids Res 44: 3878–3891.
OpenUrl CrossRef PubMed

[48] ↵
Pokorska, J., M. Dusza, D. Kulaj, K. Zukowski and J. Makulska, 2016 Single nucleotide polymorphisms in the CXCR1 gene and its association with clinical mastitis incidence in Polish Holstein-Friesian cows. Genet Mol Res 15.

[49] ↵
Racioppi, L., P. K. Noeldner, F. Lin, S. Arvai and A. R. Means, 2012 Calcium/calmodulin-dependent protein kinase kinase 2 regulates macrophage-mediated inflammatory responses. J Biol Chem 287: 11579–11591.
OpenUrl Abstract/FREE Full Text

[50] ↵
Resende, M. F., Jr.., P. Munoz, M. D. Resende, D. J. Garrick, R. L. Fernando et al., 2012 Accuracy of genomic selection methods in a standard data set of loblolly pine (Pinus taeda L.). Genetics 190: 1503–1510.
OpenUrl Abstract/FREE Full Text

[51] ↵
Rhee, I., M. C. Zhong, B. Reizis, C. Cheong and A. Veillette, 2014 Control of dendritic cell migration, T cell-dependent immunity, and autoimmunity by protein tyrosine phosphatase PTPN12 expressed in dendritic cells. Mol Cell Biol 34: 888–899.
OpenUrl Abstract/FREE Full Text

[52] ↵
Rubin, C. J., M. C. Zody, J. Eriksson, J. R. Meadows, E. Sherwood et al., 2010 Whole-genome resequencing reveals loci under selection during chicken domestication. Nature 464: 587–591.
OpenUrl CrossRef PubMed Web of Science

[53] ↵
Sahana, G., B. Guldbrandtsen, B. Thomsen, L. E. Holm, F. Panitz et al., 2014 Genome-wide association study using high-density single nucleotide polymorphism arrays and whole-genome sequences for clinical mastitis traits in dairy cattle. J Dairy Sci 97: 7258–7275.
OpenUrl

[54] ↵
Saowaphak, P., M. Duangjinda, S. Plaengkaeo, R. Suwannasing and W. Boonkum, 2017 Genetic correlation and genome-wide association study (GWAS) of the length of productive life, days open, and 305-days milk yield in crossbred Holstein dairy cattle. Genet Mol Res 16.

[55] ↵
Schukken, Y. H., J. Hertl, D. Bar, G. J. Bennett, R. N. Gonzalez et al., 2009 Effects of repeated gram-positive and gram-negative clinical mastitis episodes on milk yield loss in Holstein dairy cows. J Dairy Sci 92: 3091–3105.
OpenUrl CrossRef PubMed

[56] ↵
Schweighoffer, E., J. Nys, L. Vanes, N. Smithers and V. L. J. Tybulewicz, 2017 TLR4 signals in B lymphocytes are transduced via the B cell antigen receptor and SYK. J Exp Med 214: 1269–1280.
OpenUrl Abstract/FREE Full Text

[57] ↵
Selimovic-Hamza, S., C. L. Boujon, M. Hilbe, A. Oevermann and T. Seuberlich, 2017 Frequency and Pathological Phenotype of Bovine Astrovirus CH13/NeuroS1 Infection in Neurologically-Diseased Cattle: Towards Assessment of Causality. Viruses 9.

[58] ↵
Sodeland, M., M. P. Kent, H. G. Olsen, M. A. Opsal, M. Svendsen et al., 2011 Quantitative trait loci for clinical mastitis on chromosomes 2, 6, 14 and 20 in Norwegian Red cattle. Anim Genet 42: 457–465.
OpenUrl PubMed

[59] ↵
Strilic, B., L. Yang, J. Albarran-Juarez, L. Wachsmuth, K. Han et al., 2016 Tumour-cell-induced endothelial cell necroptosis via death receptor 6 promotes metastasis. Nature 536: 215–218.
OpenUrl

[60] ↵
Swinkels, J. M., H. Hogeveen and R. N. Zadoks, 2005 A partial budget model to estimate economic benefits of lactational treatment of subclinical Staphylococcus aureus mastitis. J Dairy Sci 88: 4273–4287.
OpenUrl PubMed

[61] ↵
Tiezzi, F., K. L. Parker-Gaddis, J. B. Cole, J. S. Clay and C. Maltecca, 2015 A genome-wide association study for clinical mastitis in first parity US Holstein cows using single-step approach and genomic matrix re-weighting procedure. PLoS One 10: e0114919.
OpenUrl

[62] ↵
Usman, T., Y. Wang, C. Liu, X. Wang, Y. Zhang et al., 2015 Association study of single nucleotide polymorphisms in JAK2 and STAT5B genes and their differential mRNA expression with mastitis susceptibility in Chinese Holstein cattle. Anim Genet 46: 371–380.
OpenUrl

[63] ↵
Varshney, R. K., R. K. Saxena, H. D. Upadhyaya, A. W. Khan, Y. Yu et al., 2017 Whole-genome resequencing of 292 pigeonpea accessions identifies genomic regions associated with domestication and agronomic traits. Nat Genet 49: 1082–1088.
OpenUrl

[64] ↵
Visscher, P. M., N. R. Wray, Q. Zhang, P. Sklar, M. I. McCarthy et al., 2017 10 Years of GWAS Discovery: Biology, Function, and Translation. Am J Hum Genet 101: 5–22.
OpenUrl CrossRef PubMed

[65] ↵
Visser, M., R. J. Palstra and M. Kayser, 2014 Human skin color is influenced by an intergenic DNA polymorphism regulating transcription of the nearby BNC2 pigmentation gene. Hum Mol Genet 23: 5750–5762.
OpenUrl CrossRef PubMed Web of Science

[66] ↵
Wang, S., P. Liu, J. Lv, Y. Li, T. Cheng et al., 2016a Serial sequencing of isolength RAD tags for cost-efficient genome-wide profiling of genetic and epigenetic variations. Nat Protoc 11: 2189–2200.
OpenUrl CrossRef

[67] ↵
Wang, S., E. Meyer, J. K. McKay and M. V. Matz, 2012 2b-RAD: a simple and flexible method for genome-wide genotyping. Nat Methods 9: 808–810.
OpenUrl CrossRef PubMed Web of Science

[68] ↵
Wang, S., Y. Zhang, W. Dai, K. Lauter, M. Kim et al., 2016b HEALER: homomorphic computation of ExAct Logistic rEgRession for secure rare disease variants analysis in GWAS. Bioinformatics 32: 211–218.
OpenUrl CrossRef PubMed

[69] ↵
Wang, X., P. Ma, J. Liu, Q. Zhang, Y. Zhang et al., 2015 Genome-wide association study in Chinese Holstein cows reveal two candidate genes for somatic cell score as an indicator for mastitis susceptibility. BMC Genet 16: 111.
OpenUrl

[70] ↵
Weissbrod, O., J. Flint and S. Rosset, 2018 Estimating SNP-Based Heritability and Genetic Correlation in Case-Control Studies Directly and with Summary Statistics. Am J Hum Genet 103: 89–99.
OpenUrl CrossRef

[71] ↵
Welderufael, B. G., L. L. G. Janss, D. J. de Koning, L. P. Sorensen, P. Lovendahl et al., 2017 Bivariate threshold models for genetic evaluation of susceptibility to and ability to recover from mastitis in Danish Holstein cows. J Dairy Sci 100: 4706–4720.
OpenUrl

[72] ↵
Wellcome Trust Case Control, C., J. B. Maller, G. McVean, J. Byrnes, D. Vukcevic et al., 2012 Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat Genet 44: 1294–1301.
OpenUrl CrossRef PubMed

[73] ↵
Wiggans, G. R., T. S. Sonstegard, P. M. VanRaden, L. K. Matukumalli, R. D. Schnabel et al., 2009 Selection of single-nucleotide polymorphisms and quality of genotypes used in genomic evaluation of dairy cattle in the United States and Canada. J Dairy Sci 92: 3431–3436.
OpenUrl CrossRef PubMed

[74] ↵
Wijga, S., J. W. Bastiaansen, E. Wall, E. Strandberg, Y. de Haas et al., 2012 Genomic associations with somatic cell score in first-lactation Holstein cows. J Dairy Sci 95: 899–908.
OpenUrl PubMed

[75] ↵
Wu, X., M. S. Lund, G. Sahana, B. Guldbrandtsen, D. Sun et al., 2015 Association analysis for udder health based on SNP-panel and sequence data in Danish Holsteins. Genet Sel Evol 47: 50.
OpenUrl

[76] ↵
Xu, X., Y. Hou, X. Yin, L. Bao, A. Tang et al., 2012 Single-cell exome sequencing reveals single-nucleotide mutation characteristics of a kidney tumor. Cell 148: 886–895.
OpenUrl CrossRef PubMed Web of Science

[77] ↵
Yang, J., S. H. Lee, M. E. Goddard and P. M. Visscher, 2011 GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 88: 76–82.
OpenUrl CrossRef PubMed

[78] ↵
Zhang, Q., B. Guldbrandtsen, J. R. Thomasen, M. S. Lund and G. Sahana, 2016 Genome-wide association study for longevity with whole-genome sequencing in 3 cattle breeds. J Dairy Sci 99: 7289–7298.
OpenUrl

[79] ↵
Zhao, Y., K. Su, G. Wang, L. Zhang, J. Zhang et al., 2017 High-Density Genetic Linkage Map Construction and Quantitative Trait Locus Mapping for Hawthorn (Crataegus pinnatifida Bunge). Sci Rep 7: 5492.
OpenUrl

[80] ↵
Zhou, W., J. B. Nielsen, L. G. Fritsche, R. Dey, M. E. Gabrielsen et al., 2018 Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat Genet 50: 1335–1341.
OpenUrl CrossRef PubMed