Introduction
In the pork industry, fat deposition traits such as backfat thickness and fat percentage are considered of major economic importance. Research on how genetic variants affect fat deposition could be useful both theoretically and practically. By using candidate gene and quantitative trait loci (QTL) mapping strategies, genes, such as
MC4R (melanocortin-4 receptor) [
1] and
FTO (fat mass and obesity-associated) [
2], were successfully identified as contributing to fat deposition. As a result of low mapping density of microsatellite markers, only a few quantitative trait loci have been recognized as associated with fat deposition [
3,
4]. Due to the rapid development of technology for high throughout sequencing, the current porcine 60K SNP chip provides a higher density than previous chips and can improve precision in locating QTL regions [
5].
A number of economically important traits in pigs have been investigated in genome-wide association (GWA) studies [
6,
7]. However, in most of the GWA studies only one experimental population was used, and the results have not been verified in independent populations. The aims of this study were to detect potential genetic variants in the backfat thickness trait in two independent populations of pigs using GWA study and to investigate how this is affected by genetic background.
Materials and methods
Animals and traits
Two pig populations were used; a population composed of 820 sows from two genetic lines including Large White pigs and Large White × Landrace cross (Population 1) and an F
2 population built by intercrossing Berkshire boars and Yorkshire sows from which 208 F
2 pigs were randomly selected (Population 2). Phenotype assessments of the tenth rib backfat thickness were recorded as previously described [
4,
7], when individual pigs reached a bodyweight of about 130 kg.
SNP chip genotyping and quality control
All pigs were genotyped by Illumina PorcineSNP60 BeadChip, containing 62163 SNPs across the entire genome. The gPLINK software [
8] was used for quality control of genotyped data according to the following criteria: (1) call rate>0.9, (2) minor allele frequency (MAF)>0.01, and (3) significant divergence from Hardy-Weinberg Equilibrium with
P-values>10
-6. For Population 1, a total of 56440 SNPs passed the quality control and were retained in the dataset. For Population 2, 13678 SNPs were removed and 49485 SNPs were kept for further analyzes. In addition, a total of 41635 SNPs were selected for cluster analysis of the two populations.
Statistical analysis
GWA analysis was performed with the Genome Association and Prediction Integrated Tool (GAPIT) using the compressed mixed linear model (CMLM) method [
9]. The CMLM is described as follows:
where Y is the vector of observed phenotypes, X is a matrix representing genotypes, P is the principal component matrix for population structure, K is the kinship matrix. Xα and Pβ are regarded as fixed effects and Kμ and e are regarded as random effects.
Population structure represents non-random distribution of genotypes among individuals within a population, and population structure and unequal relatedness among individuals in a population are the two major causes of false positive results in association studies [
9,
10]. In the CMLM method, population structure is fitted as a fixed effect, whereas kinship among individuals is incorporated as the variance-covariance structure of the random effect for individuals. In addition, all individuals will be clustered into groups to give better control of population structure and kinship so as to reduce the false positive association rate [
9]. SNPs are sorted by respective
P-values and SNPs showing high significance for the trait will have the smallest
P-values. Thirty SNPs with the lowest
P-values were chosen for annotation by Ensembl
Sus scrofa Build 10.2 (http://asia.ensembl.org/Sus_scrofa/Info/Index?db=core). A cluster analysis of populations was performed by principle component analysis (PCA) using GAPIT.
Haplotype block analysis was performed on the genome region containing the significantly associated SNPs in Population 2 and run with the default parameters using HAPLOVIEW program [
11].
Results and discussion
Population stratification is one of the major factors influencing the validity of a GWA study [
10]. The PCA clustering of two populations is shown in Fig. 1. For Population 1, the individuals from two genetic lines were classified into one cluster, suggesting there was no significant genetic difference between these lines. In addition, sows from Population 1 were obviously separated from those from Population 2.
GWA results for the tenth rib backfat thickness trait are shown in Fig. 2. Different associations between two pig populations were evident. The tenth rib backfat thickness trait is not only controlled by multiple genes but can also be affected to varying degrees by other environmental factors. For quantitative traits, it is difficult to confirm QTL or genes that have common effects and the effects, such as fatness and pork quality, could be verified in different pig populations [
12]. Furthermore, the separation of two populations in clustering analysis (Fig. 1) suggested the existence of genetic differences between these populations, which might partly explain the different GWA results.
For Population 1 (Fig. 2a), the 30 most significant SNPs were chosen for annotation and five SNPs were in disassembled scaffolds of
Sus scrofa 10.2. Among the 25 mapped SNPs, the largest number were on SSC1 (9), and the rest were distributed on SSC6 (6), SSC7 (3), SSC9 (2), SSC12 (2), SSC16 (2) and SSCX (1). The location of the SNPs on SSC1 and SSC7 were consistent previous reports [
7,
13], and the SNPs on SSC1 showed the greatest association with the trait. The top four significant SNPs were located on SSC1 (ALGA0006623, ALGA0006599, INRA0004898 and DRGA0001605). Additionally, the most significant SNP (ALGA0006623) was located at around 528 kb upstream of
MC4R gene. MC4R, a G-protein coupled receptor, has been implicated in mediating the effect of leptin on food intake and energy balance, and has previously been found to be associated with fat deposition in pig [
1]. Also,
CCBE1 (collagen and calcium binding EGF domains 1) gene (within gene, ASGA0005017) on SSC1, has been related to fat metabolism through DAVID (http://david.abcc.ncifcrf.gov/) functional annotation [
7]. Two SNPs (H3GA00 20700 and MARC0069646) corresponded to a fat QTL region between TNFB (58 cM) and S0102 (70.1 cM) on SSC7 [
13].
In Population 2 (Fig. 2b), the 27 most significant SNPs were mapped in the
Sus scrofa 10.2, and 12 SNPs (M1GA0016251, MARC0075799, ASGA0053328, ALGA0065206, ALGA0065212, ALGA0065251, MARC0014203, M1GA0016298, ALGA0065308, ASGA0053453, MARC0040976 and ASGA0053456) were located within a 1.78 Mb segment (between 14.14 and 15.92 Mb) on SSC12, which corresponded to a QTL region associated with fat metabolism [
14]. There were eight SNPs located within five annotated genes:
PITPNC1 (phosphatidylinositol transfer protein, cytoplasmic 1),
TEX2 (testis expressed 2),
FTSJ3 (FtsJ homolog 3),
MAP3K3 (mitogen-activated protein kinasekinasekinase 3) and
MARCH10 (membrane-associated ring finger (C3HC4) 10, E3 ubiquitin protein ligase). Two haplotype blocks were identified on SSC12 for these 12 SNPs (Fig. 3). The first haplotype block, M1GA0016251-MARC0075799, was within an 18 kb fragment, located in the
PITPNC1 gene,which has been associated with fatty acid transport or metabolism. PITPNC1, also known as M-rdgB beta, encodes a protein that belongs to the Nir/rdgB family and has been implicated in a broad spectrum of cellular functions, such as regulation of lipid trafficking and metabolism [
14]. The second haplotype block, ALGA0065251-MARC0014203-M1GA0016298-ALGA 0065308, was within a 401 kb fragment, identified around 16 genes. Of them,
GH1 (growth hormone 1) has been reported to be associated with childhood obesity [
15]. GH1 is an endocrine factor secreted from the anterior pituitary gland that serves to synchronize growth and metabolism. GH1 deficiency in mammals has been found to be associated with stunted somatic growth and increased adipose tissue [
15]. Also,
GH1 gene was found to be related to fat deposition in pigs [
12].
Conclusions
GWA studies for the tenth rib backfat thickness trait were undertaken on two pig populations of distinct genetic backgrounds, and the associated genes and genome regions have been identified. This not only confirmed the reported candidate genes but also revealed novel genes. The results differed between the two populations indicating that the genetic mechanisms regulating backfat thickness are complex and GWA results can be affected by the genetic background of the population studied.
Higher Education Press and Springer-Verlag Berlin Heidelberg