A human-specific allelic group of the MHC DRB1 gene in primates

  • Yoshiki Yasukochi1Email author and

    Affiliated with

    • Yoko Satta2

      Affiliated with

      Journal of Physiological AnthropologyAn official journal of the Japan Society of Physiological Anthropology (JSPA)201433:14

      DOI: 10.1186/1880-6805-33-14

      Received: 24 December 2013

      Accepted: 27 May 2014

      Published: 13 June 2014



      Diversity among human leukocyte antigen (HLA) molecules has been maintained by host-pathogen coevolution over a long period of time. Reflecting this diversity, the HLA loci are the most polymorphic in the human genome. One characteristic of HLA diversity is long-term persistence of allelic lineages, which causes trans-species polymorphisms to be shared among closely related species. Modern humans have disseminated across the world after their exodus from Africa, while chimpanzees have remained in Africa since the speciation event between humans and chimpanzees. It is thought that modern humans have recently acquired resistance to novel pathogens outside Africa. In the present study, we investigated HLA alleles that could contribute to this local adaptation in humans and also studied the contribution of natural selection to human evolution by using molecular data.


      Phylogenetic analysis of HLA-DRB1 genes identified two major groups, HLA Groups A and B. Group A formed a monophyletic clade distinct from DRB1 alleles in other Catarrhini, suggesting that Group A is a human-specific allelic group. Our estimates of divergence time suggested that seven HLA-DRB1 Group A allelic lineages in humans have been maintained since before the speciation event between humans and chimpanzees, while chimpanzees possess only one DRB1 allelic lineage (Patr-DRB1*03), which is a sister group to Group A. Experimental data showed that some Group A alleles bound to peptides derived from human-specific pathogens. Of the Group A alleles, three exist at high frequencies in several local populations outside Africa.


      HLA Group A alleles are likely to have been retained in human lineages for a long period of time and have not expanded since the divergence of humans and chimpanzees. On the other hand, most orthologs of HLA Group A alleles may have been lost in the chimpanzee due to differences in selective pressures. The presence of alleles with high frequency outside of Africa suggests these HLA molecules result from the local adaptations of humans. Our study helps elucidate the mechanism by which the human adaptive immune system has coevolved with pathogens over a long period of time.


      Allelic lineage Balancing selection HLA Out-of-Africa Pathogen Trans-species polymorphism



      base pair


      coding sequence

      d N

      number of non-synonymous substitutions per non-synonymous site

      d S

      number of synonymous substitutions per synonymous site

      d Smax

      maximum genetic distance at synonymous sites




      human leukocyte antigen


      human papillomavirus type 11


      influenza B virus


      Immune Epitope Database

      K S

      number of synonymous substitutions

      K Smax

      maximum number of synonymous substitutions


      linkage disequilibrium

      L S

      mean number of synonymous site


      major histocompatibility complex


      maximum likelihood


      million years ago

      N e

      effective population size




      peptide-binding region


      time to most recent common ancestor.


      Modern humans (Homo sapiens) live in a wide variety of environments, ranging from polar to tropical regions. Physiological anthropologists have long addressed the issue of ‘human adaptation’ to a variety of environments (that is the ability of humans to survive in a changing environment). Molecular evolution and population genetics also focus on the adaptation of humans to environmental changes. The approach of physiological anthropology is mainly to investigate differences in physiological modifications among individuals or ethnic groups in various environments (‘physiological polymorphism’) in order to understand human adaptation. On the other hand, molecular evolution or population genetics seek indications of natural selection by comparing nucleotide sequences of a target gene. If a new mutation at a target locus confers advantage for fitness in a certain environment, such a mutation is expected to rapidly spread throughout a population because of positive natural selection. Methods to detect such a signal of natural selection have been developed. For instance, in a protein coding gene, an excess in the number of non-synonymous substitutions (that change the amino acid sequence) over synonymous substitutions (neutral mutation) suggests that positive selection or balancing selection has occurred during the evolution of the target gene. In addition, the relationship between an allelic frequency and the extent of linkage disequilibrium (LD) around the selected mutation help us to find an allele that has rapidly spread in a population [1]. The advantageous allele is expected to dramatically increase its frequency in a short time so that recombination does not substantially break down the LD around the selected site.

      Humans live in various environments around the world. The endemic pathogens that humans are infected by in these areas differ and humans have evolved to deal with these pathogens. In the present study, we focus on polymorphisms in the major histocompatibility complex (MHC), which plays an important role in triggering immune reactions in response to pathogens, and we discuss the possibility that a human-specific MHC allele is involved in the immunological adaptation to a human-specific pathogen.

      The MHC is a set of cell-surface molecules that are responsible for presenting antigens from pathogens to lymphocytes in jawed vertebrates. As such, it is an important genetic system for protection against infectious disease [2]. In humans, the MHC is termed human leukocyte antigen (HLA). The HLA genomic region is located on the short arm of chromosome 6 at 6p21.3, spanning approximately 4 Mbp and comprising 224 genes [3]. The region is classified into three subregions: class I, class II, and class III regions. Among HLA molecules, six class I and II molecules (HLA-A, B, and C of class I and HLA-DR, DQ, and DP of class II) are important for antigen presentation to T lymphocytes. Class I molecules mainly bind to peptides from cytosolic proteins and the HLA-peptide complex is recognized by CD8+ T cells. Class II molecules present extracellular antigens to CD4+ T cells. Class I molecules consist of two polypeptide chains, an α heavy chain encoded in the class I region, and a β2-microglobulin light chain encoded on chromosome 15. Class II molecules are composed of two polypeptide chains, α and β chains, encoded in the class II region. For instance, the DRA and DRB1 genes in the class II region encode the α and β chains, respectively, of the DR molecule. A peptide-binding region (PBR) was characterized with crystallography by Bjorkman et al. [4] for class I HLA-A and by Brown et al. [5] for class II HLA-DR. Molecular evolutionary studies of this region have revealed an enhancement of non-synonymous substitutions in the PBR, suggesting that the PBR is a target for balancing selection, which is responsible for the maintenance of HLA polymorphisms [610].

      Polymorphisms in HLA genes have three unique features: (1) a large number of alleles, (2) a high degree of heterozygosity, and (3) remarkably long persistence time of the allelic lineage. These features are maintained by balancing selection but not by an increased mutation rate [11, 12].

      The chimpanzee (Pan troglodytes) is the closest extant relative of humans. Interestingly, chimpanzees appear to have resistance to several pathogens to which humans are susceptible, including HIV type 1 and human hepatitis B virus [13]. This indicates that the two species differ in their immune responses to these pathogens, and that possibly the pathogen recognition repertoire for MHC is different between the two species. Chimpanzees share some class II DRB1 allelic lineages with humans [1416]. In humans, genetic variation and selective intensity on DRB1 are the greatest in the class II genes [17]. In humans, there are 13 DRB1 allelic lineages (HLA-DRB1*01, *03, *04, *07, *08, *09, *10, *11, *12, *13, *14, *15 and *16), while there are only four allelic lineages (Patr-DRB1*02, *03, *07 and *10) in chimpanzees [1416].

      Chimpanzees have stayed in Africa since their divergence from humans approximately six million years ago (MYA). On the other hand, modern humans have dispersed across the world from Africa from 100,000 to 50,000 years ago and have adapted to regions with various exogenous pathogens. This begs the question of how modern humans have acquired resistance to a variety of pathogens in different environments. Therefore, the present study investigated the evolution of HLA-DRB1 alleles that confer resistance to novel pathogens in humans. For this purpose, we studied nucleotide sequences of HLA genes using the IMGT/HLA database (http://​www.​ebi.​ac.​uk/​imgt/​hla/​, [18]).

      Materials and methods

      Nucleotide sequences of humans, chimpanzees, rhesus monkeys (Macaca mulatta), and crab-eating macaques (Macaca fascicularis) were used for phylogenetic analyses. A dataset of human DRB allele sequences, including DRB1 and other functional DRB (DRB3, DRB4, and DRB5), was obtained from the IMGT/HLA database. The dataset of non-human primate DRB1 alleles was obtained from the IPD MHC NHP database (http://​www.​ebi.​ac.​uk/​ipd/​mhc/​nhp/​, [19]). In the database, there were many partial coding sequences (CDS) (mainly exon 2 sequences). Using incomplete sequences is likely to be misleading in analysis of the phylogenetic relationships among sequences; therefore, we performed phylogenetic analyses only for full-length DRB1 CDS. Because only partial sequences were available, we also excluded sequence data of the gorilla (Gorilla gorilla) and orangutan (Pongo pygmaeus) from the present analysis. We used two HLA-DQB1 alleles as outgroup sequences. Next, we removed sequences of potential recombinant alleles according to a method that assumes a binomial distribution of the ratio of substitutions in a particular region to that in the entire region [17, 2022]. For phylogenetic analyses, we used 104 complete CDS: 56 HLA-DRB1, 6 HLA-DRB3, 4 HLA-DRB4, 2 HLA-DRB5, 11 chimpanzee Patr-DRB1, 22 rhesus monkey Mamu-DRB1, and 3 crab-eating macaque Mafa-DRB1 alleles.

      Brown et al. [5] identified 24 amino acids in the PBR of HLA-DRB1 genes. In addition to the defined PBR, we included three amino acid sites (positions of 57, 67, and 90; for a total of 27 amino acids), because Brown and collaborators have subsequently shown that the three sites are involved in the formation of peptide-binding grooves and peptide binding [23].

      Multiple sequence alignment of nucleotide sequences and phylogenetic-tree construction were performed using the MEGA v5.10 software [24]. A maximum likelihood (ML) tree for the non-PBR region was constructed based on the Hasegawa-Kishino-Yano (HKY) substitution model [25] with the nearest-neighbor-interchange (NNI) ML heuristic search. The best-fit substitution model was estimated by MEGA. Bootstrap analysis was performed using 1,000 replications. The numbers of non-synonymous substitutions per non-synonymous site (dN) and synonymous substitutions per synonymous site (dS) were calculated using the modified Nei-Gojobori method [26] with a Jukes-Cantor correction [27]. The transition/transversion bias used in this calculation was estimated with the ML method in MEGA. The average divergence time of DRB1 alleles was estimated by the average of all pairwise dS values, and the time to the most recent common ancestor (TMRCA) of alleles was estimated from the maximum number of synonymous substitutions per site (dSmax). The divergence time was estimated by the following formula:
      TMRCA = d Smax / 2 μ http://static-content.springer.com/image/art%3A10.1186%2F1880-6805-33-14/MediaObjects/40101_2013_Article_69_Equa_HTML.gif

      where μ is the neutral substitution rate of 10−9 per site per year at the MHC loci [9]. Pathogens recognized by HLA-DRB1 molecules were examined using the Immune Epitope Database (IEDB) (http://​www.​immuneepitope.​org, [28]). Information about HLA-DRB1 allele frequency among different human populations was collected from the NCBI dbMHC database (http://​www.​ncbi.​nlm.​nih.​gov/​gv/​mhc, [29]).

      Results and discussion

      Two phylogenetic groups of HLA-DRB1 alleles and human-specific HLA Group A

      To examine phylogenetic relationships among DRB alleles in four primate species (HLA-DRB1/3/4/5, Patr-DRB1, Mamu-DRB1, and Mafa-DRB1), an ML tree was constructed from nucleotide sequences of the non-PBR region (Figure 1). Nucleotide sequences in the PBR were excluded for construction of the tree because they had an approximately ten-fold higher amino acid-altering (non-synonymous) substitution rate than synonymous substitutions due to balancing selection (Hughes and Nei [6, 7]; Takahata and Nei [11]). When we focused on HLA-DRB1 alleles, we identified two distinct clades in the ML tree. We refer to these two groups as HLA Group A and HLA Group B. Of the 13 known HLA allelic lineages, seven lineages, including DRB1*03, *08, *10, *11, *12, *13, and *14, were assigned to Group A, while the remaining six lineages, DRB1*01, *04, *07, *09, *15, and *16, were assigned to Group B.
      Figure 1

      Maximum likelihood tree for nucleotide sequences (690 bp) in the non-peptide-binding region (PBR) of MHC DRB alleles. The sequence data of MHC DRB alleles, including those of humans, chimpanzees, and macaques, were obtained from IMGT/HLA and IPD databases. HLA-DRB1 alleles are indicated in bold. Arrow indicates the Patr-DRB1*03 lineage, which is a sister group of HLA Group A alleles. Only bootstrap values > 80% are shown. Two HLA-DQB1 sequences were used as an outgroup. The evolutionary distances were computed using the Hasegawa-Kishino-Yano (HKY) model. HLA Group A and HLA Group B indicate two major phylogenetic groups of HLA-DRB1 alleles. HLA, humans; Patr, chimpanzees; Mamu, rhesus monkeys; Mafa, crab-eating macaques.

      In the ML tree, the Group B alleles showed trans-species evolution of polymorphisms with those in the chimpanzee (Patr-DRB1*02 and *07). Interestingly, 31 Group A alleles formed a monophyletic clade distinct from other primate DRB1 alleles, although the bootstrap value for supporting this cluster was not particularly high, suggesting that the Group A alleles are human-specific. Previous studies [1416] have not identified this DRB1 monophyletic group in humans, because the nucleotide sequences used in those studies were limited to exon 2.

      Both the mean and maximal dS values were larger in Group B (mean dS, 0.041; dSmax, 0.082) than in Group A (mean dS, 0.018; dSmax, 0.057) (Table 1). This indicates that most allelic lineages in Group B have been maintained for a longer time than those in Group A. Additionally, Group A alleles may have diverged more recently than Group B alleles. Based on these results, we propose two hypotheses for the monophyly of Group A: (1) Group A alleles specifically expanded in the human lineage or (2) the orthologs to Group A alleles were lost in chimpanzees. We estimated the divergence time for alleles in each group in order to test these hypotheses.
      Table 1

      The divergence time of the two HLA groups, HLA -Group A and HLA -Group B






      Group A


      9 MYA


      29 MYA

      Group B


      21 MYA


      41 MYA

      T indicates the mean divergence time of alleles. TMRCA indicates the time to the most recent common ancestor of alleles.

      Divergence time of alleles in HLA Groups A and B

      The phylogeny showed a difference in divergence time between Groups A and B. The mean divergence times for Groups A and B were approximately 9 and 21 MYA, respectively, and TMRCAs were approximately 29 and 41 MYA, respectively (Table 1). These values suggest the presence of specific trans-species polymorphisms [10, 30, 31] in both groups, because the mean divergence time exceeded the speciation time of humans and chimpanzees [3234]. Based on this result, we rejected the hypothesis that the HLA Group A allelic lineages specifically expanded in humans. However, the tree revealed that alleles in Group A did not intermingle with other non-human primate DRB1 alleles (Figure 1). The closest was the Patr-DRB1*03 lineage cluster (indicated by an arrow in Figure 1).

      Furthermore, we estimated the TMRCA of Patr-DRB1*03 cluster to be 4.6 MYA (Figure 2), suggesting that the alleles in this cluster diverged in chimpanzees after their divergence from humans. Accordingly, only one allelic lineage leading to the cluster in extant chimpanzees existed in the common ancestral population of humans and chimpanzees. On the other hand, in humans, pairwise dS distances between HLA-DRB1 alleles suggested that seven allelic lineages existed in the ancestral population (Figure 2). Therefore, the common ancestral population likely possessed at least eight allelic lineages.
      Figure 2

      Divergence times of HLA Group A and Patr-DRB1*03 alleles. The dashed line represents the speciation event of humans and chimpanzees. Times to most recent common ancestor (TMRCAs) were estimated based on the maximum genetic distance at synonymous sites (dSmax).

      Although alleles in Group A formed a single clade in the ML tree of primate DRB alleles, the TMRCA was 29 MYA, which is significantly older than six MYA (that is the speciation time of humans and chimpanzees). Thus, the molecular clock for DRB1 alleles may have been skewed by various factors, such as back or parallel mutations (multiple mutations) or recombination/gene conversion. Indeed, in the Group A allele sequences, there was segregation of 21 synonymous sites. Among them, ten were singletons with a unique nucleotide observed only once in the sampled alleles, and 11 were phylogenetically informative sites. Among 55 pairs of 11 informative sites, 13 pairs were phylogenetically incompatible with each other. This incompatibility was likely the result of either recombination/gene conversion or multiple mutations at a single site. In the event of recombination/gene conversion, however, double recombination in a relatively small region or a conversion tract with a small size should be considered. Multiple mutations are a more likely cause of this incompatibility. To examine whether the presence of multiple substitutions masked an accurate estimate of the TMRCA, we tested the accuracy of the correction for multiple substitutions in the calculation of dSmax.

      For this purpose, we estimated the maximum number of synonymous substitutions in a different way. First, we placed synonymous substitutions observed in the Group A alleles on each branch of the ML tree parsimoniously (Figure 1 and Additional file 1: Figure S1) and re-counted the number of synonymous substitutions (KS) in each pair of Group A alleles. The maximum KS was thirteen (KSmax = 13). TMRCA was calculated from this KSmax divided by the mean number of synonymous sites (LS = 223). As a result, the TMRCA of the Group A alleles was estimated to be 29 MYA. This showed good agreement with the TMRCA estimated by the Jukes-Cantor correction (29 MYA). Because there was no bias in our method of estimating TMRCA, we considered it to be reliable.

      Probability of maintaining seven human-specific HLA Group A allelic lineages over six million years

      A method for calculating the probability, g nk (t) [35], that there were k allelic lineages among n extant lineages for t in N generations under balancing selection is available. In the present study, we tried to calculate the probability g nk (t) for seven ancestral allelic lineages being maintained since approximately six MYA among a sample of 31 Group A alleles (n = 31). However, because HLA-DRB1 also contains the 25 Group B alleles, the 31 Group A sequences are only a part of the samples in the entire HLA-DRB1. There were no means to determine the effective population size (Ne) of these subpopulations, which was required for the calculation of g nk (t); therefore, we could not calculate the probability of maintaining the current Group A alleles for six million years.

      The effective population size Ne of modern humans is smaller than that of chimpanzees [3638], and the eight allelic lineages in the ancestral population have likely been lost more frequently from the human lineage than the chimpanzee lineage. Nevertheless, the number of allelic lineages in humans is seven times larger than that in chimpanzees. This supports the hypothesis that natural selection selectively maintained Group A alleles in humans. It is important to understand the biological reasons why these seven lineages have been maintained only in humans.

      Specific peptides bound to HLA Group A alleles

      It is possible that the HLA Group A allelic lineages have been because they bind to peptides derived from human-specific pathogens. Thus, we examined pathogens and their specific peptides recognized by each of the Group A and B allelic lineages based on information of experimental data from the IEDB database (Table 2). There were ten pathogens that produced peptides bound only by Group A alleles (for example, human papillomavirus type 11 (HPV-11) and influenza B virus (IBV)), and some of them were candidates for human-specific pathogens. In fact, in addition to HPV-11, Bordetella pertussis and measles viruses have been reported to be human-specific pathogens [39, 40] (Table 2). Moreover, the IBV is restricted to humans with the exception of an infection identified in seals stranded on the Dutch coast [41]. At present, however, the repertoire of peptides bound by each allele is limited in the experimental data. Future studies will establish whether chimpanzees and macaques MHC are able to bind HLA Group A-specific peptides.
      Table 2

      The comparison of specific pathogen bound by HLA-DRB1 molecules between Group A and Group B

      Source organism ID

      Source organism name

      HLA-DRB1 molecule



      Bordetella pertussis


      Group A


      Streptococcus mutans



      Dermatophagoides pteronyssinus



      Hepatitis B virus subtype adw2



      Human papillomavirus type 11



      Measles virus

      HLA-DRB1*11:01, 11:03


      Influenza B virus [B/Hong Kong/330/2001]



      Influenza A virus [A/Bangkok/1/1979(H3N2)]



      Influenza A virus [A/Japan/305/1957(H2N2)]

      HLA-DRB1*11:01, 11:02, 11:03, 11:04


      Streptococcus mutans GS-5



      Helicobacter pylori

      HLA-DRB1*04:01, 15:01

      Group B


      Brucella ovis

      HLA-DRB1*04:01, 04:07


      Escherichia coli

      HLA-DRB1*04:01, 07:01, 15:01


      Streptococcus pneumoniae



      Lactococcus lactis

      HLA-DRB1*04:01, 04:07


      Bacillus subtilis



      Mycobacterium avium






      Vaccinia virus WR



      Herpes simplex virus (type 1/strain SC16)

      HLA-DRB1*04:01, 15:01


      Human rotavirus strain P

      HLA-DRB1*04:01, 04:04, 15:01


      Unidentified influenza virus



      HIV type 1 (CLONE 12)



      HIV virus type 1 (JH3 ISOLATE)

      HLA-DRB1*09:01, 15:01





      Influenza A virus [A/Hong Kong/156/97(H5N1)]



      Influenza virus A

      HLA-DRB1*04:01, 04:05


      Influenza A virus [A/Puerto Rico/8/1934(H1N1)]

      HLA-DRB1*04:01, 04:05


      Burkholderia mallei ATCC 23344



      Mammalian orthoreovirus

      HLA-DRB1*04:01, 15:01



      Influenza A virus [A/New Caledonia/20/1999(H1N1)]



      Influenza A virus [A/Panama/2007/1999(H3N2)]



      Influenza A virus [A/Singapore/1/1957(H2N2)]



      Influenza A virus [A/California/04/2009(H1N1)]



      Aspergillus fumigatus

      HLA-DRB1*15:01, 15:02


      Influenza A virus [A/chicken/Uchal/8293/2006(H9N2)]



      Influenza A virus [A/chicken/Uchal/8286/2006(H9N2)]



      Influenza A virus [A/swine/Hong Kong/71/2009(H1N1)]


      The dataset of source organism bound to HLA-DRB1 molecules was acquired from the MHC binding assay in the IEDB database.

      In HLA Group B, although some pathogens infect not only humans but also other animals (for example, Brucella ovis and Burkholderia mallei), candidates for human-specific pathogens (for example, Helicobacter pylori) were included. This suggests that some Group B alleles might be also involved in local adaptation in humans.

      The frequency distributions of eight HLA-DRB1 alleles (HLA-DRB1*0301, *08:02, *11:01, *11:02, *11:03, *11:04, *12:01, and *14:01) that recognize Group A-specific pathogens were investigated using information in the NCBI dbMHC database (Additional file 2: Figure S2). The frequency distributions of HLA-DRB1*08:02, *12:01, and *14:01 were high outside Africa, suggesting that the frequency of the DRB1 molecules might have increased since the human species disseminated outside Africa.

      Chimpanzees appear to have lost a relatively large number of alleles from the Group A allelic lineage while humans have maintained several allelic lineages since their speciation. The examination of genetic variation in MHC class I Patr-A, Patr-B, and Patr-C loci suggested that the genetic variations in chimpanzees have been severely reduced [42]. In this previous study, it was hypothesized that a selective sweep caused the loss of genetic diversity at MHC loci in chimpanzees in order to avoid widespread viral infection, such as that with chimpanzee-derived simian immunodeficiency virus, prior to a subspeciation of the common chimpanzee and bonobo (Pan paniscus) approximately two MYA. Although it is not known whether such selective sweep resulted in the loss of some DRB1 allelic lineages in chimpanzees, reduced genetic variation at the three class I loci in chimpanzees may have been linked to the relatively small number of Patr-DRB1 allelic lineages.


      A phylogenetic analysis of the HLA-DRB1 gene identified two major groups of alleles, Groups A and B. Our findings suggest that Group A is human-specific and has been maintained by balancing selection in humans, while chimpanzees may have lost their counterparts to these allelic lineages due to different selective pressure. Some Group A alleles can bind to peptides derived from human specific pathogens and these showed a high frequency in populations outside Africa. Therefore, these alleles may have increased in frequency after the Out-of-Africa event. Our results imply that some of HLA Group A alleles may have contributed to local adaptation of humans.


      In the present study, we identified a candidate human-specific HLA-DRB1 allelic group. However, the sample size of chimpanzees was smaller than that of humans. Specifically, there were at least 88 chimpanzees used in published studies [14, 15, 4345], while the HLA-DRB1 alleles were detected in thousands of human individuals. Therefore, there is possible sampling bias among chimpanzees. The common chimpanzees are classified into at least four subspecies, which are, Pan troglodytes troglodytes, P. t. verus, P. t. ellioti, and P. t. schweinfurthii, in Mammal Species of the World[46]. In addition to the common chimpanzees, bonobo samples should also be included in the phylogenetic analyses of DRB1 alleles. To exclude the possibility that our finding is an artifact of sampling bias, we plan to increase the sample size of chimpanzees in future studies, which will help validate the present estimates.

      In the present study, DRB1 alleles of rhesus monkeys and crab-eating macaques formed a taxon-specific clade with the exception of HLA-DRB4*01 sequences. All sampled alleles in the two macaques formed a sister clade with HLA Group A alleles in the ML tree but not with HLA Group B alleles (Figure 1). In the future, the reason why the DRB1 alleles of macaques formed a large monophyletic group should be investigated.

      It is difficult to verify that a molecule in HLA Group A can recognize human-specific pathogens. In recent years, there has been increasing information on peptide-HLA binding. Future studies must examine the relationships among HLA alleles, binding peptides, and pathogens in order to elucidate the mechanisms by which modern humans have adapted to a variety of environments around the world.

      The contribution of natural selection to local adaptation in humans was evaluated from genomic data. The genomic data provide a universal framework for understanding human evolution and enable quantitative analysis of the operation of natural selection. We believe that molecular genetics techniques can shed light on some important issues in physiological anthropology.



      This work was supported by Grant-in-Aid for Scientific Research on Innovative Areas from the Ministry of Education, Culture, Sports, Science and Technology of Japan (22133007). We owe special thanks to Drs. Naoyuki Takahata and Jun Gojobori for providing valuable comments.

      Authors’ Affiliations

      Molecular and Genetic Epidemiology, Faculty of Medicine, University of Tsukuba
      Department of Evolutionary Studies of Biosystems, The Graduate University for Advanced Studies (SOKENDAI)


      1. Sabeti PC, Reich DE, Higgins JM, Levine HZP, Richter DJ, Schaffner SF, Gabriel SB, Platko JV, Patterson NJ, Mcdonald GJ, Ackerman HC, Campbell SJ, Altshuler D, Cooper R, Kwiatkowski D, Ward R, Lander ES: Detecting recent positive selection in the human genome from haplotype structure. Nature. 2002, 419: 832-837. 10.1038/nature01140.View ArticlePubMed
      2. Hill AV: The immunogenetics of human infectious diseases. Annu Rev Immunol. 1998, 16: 593-617. 10.1146/annurev.immunol.16.1.593.View ArticlePubMed
      3. The MHC sequencing consortium: Complete sequence and gene map of a human major histocompatibility complex. Nature. 1999, 401: 921-923. 10.1038/44853.View Article
      4. Bjorkman PJ, Saper MA, Samraoui B, Bennett WS, Strominger JL, Wiley DC: The foreign antigen binding site and T cell recognition regions of class I histocompatibility antigens. Nature. 1987, 329: 512-518. 10.1038/329512a0.View ArticlePubMed
      5. Brown JH, Jardetzky TS, Gorga JC, Stern LJ, Urban RG, Strominger JL, Wiley DC: Three-dimensional structure of the human class II histocompatibility antigen HLA-DR1. Nature. 1993, 364: 33-39. 10.1038/364033a0.View ArticlePubMed
      6. Hughes AL, Nei M: Nucleotide substitution at major histocompatibility complex class II loci: evidence for overdominant selection. Proc Natl Acad Sci U S A. 1989, 86: 958-962. 10.1073/pnas.86.3.958.PubMed CentralView ArticlePubMed
      7. Hughes AL, Nei M: Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature. 1988, 335: 167-170. 10.1038/335167a0.View ArticlePubMed
      8. Takahata N, Satta Y, Klein J: Polymorphism and balancing selection at major histocompatibility complex loci. Genetics. 1992, 130: 925-938.PubMed CentralPubMed
      9. Satta Y, O’hUigin C, Takahata N, Klein J: Intensity of natural selection at the major histocompatibility complex loci. Proc Natl Acad Sci U S A. 1994, 91: 7184-7188. 10.1073/pnas.91.15.7184.PubMed CentralView ArticlePubMed
      10. Klein J, Sato A, Nikolaidis N: MHC, TSP, and the origin of species: from immunogenetics to evolutionary genetics. Annu Rev Genet. 2007, 41: 281-304. 10.1146/annurev.genet.41.110306.130137.View ArticlePubMed
      11. Takahata N, Nei M: Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. Genetics. 1990, 124: 967-978.PubMed CentralPubMed
      12. Satta Y, O’hUigin C, Takahata N, Klein J: The synonymous substitution rate of the major histocompatibility complex loci in primates. Proc Natl Acad Sci U S A. 1993, 90: 7480-7484. 10.1073/pnas.90.16.7480.PubMed CentralView ArticlePubMed
      13. Muchmore EA: Chimpanzee models for human disease and immunobiology. Immunol Rev. 2001, 183: 86-93. 10.1034/j.1600-065x.2001.1830107.x.View ArticlePubMed
      14. Kenter M, Otting N, Anholts J, Jonker M, Schipper R, Bontrop RE: Mhc-DRB diversity of the chimpanzee (Pan troglodytes). Immunogenetics. 1992, 37: 1-11.View ArticlePubMed
      15. Mayer WE, O’hUigin C, Zaleska-Rutczynska Z, Klein J: Trans-species origin of Mhc-DRB polymorphism in the chimpanzee. Immunogenetics. 1992, 37: 12-23.View ArticlePubMed
      16. Bontrop RE, Otting N, de Groot NG, Doxiadis GGM: Major histocompatibility complex class II polymorphisms in primates. Immunol Rev. 1999, 167: 339-350. 10.1111/j.1600-065X.1999.tb01403.x.View ArticlePubMed
      17. Yasukochi Y, Satta Y: Current perspectives on the intensity of natural selection of MHC loci. Immunogenetics. 2013, 65: 479-483. 10.1007/s00251-013-0693-x.PubMed CentralView ArticlePubMed
      18. Robinson J, Mistry K, McWilliam H, Lopez R, Parham P, Marsh SGE: The IMGT/HLA database. Nucleic Acids Res. 2011, 39: D1171-D1176. 10.1093/nar/gkq998.PubMed CentralView ArticlePubMed
      19. Robinson J, Waller MJ, Stoehr P, Marsh SGE: IPD-the immuno polymorphism database. Nucleic Acids Res. 2005, 33: D523-D526.PubMed CentralView ArticlePubMed
      20. Satta Y: Balancing selection at HLA loci. Proceedings of the 17th Taniguchi Symposium. Edited by: Takahata N. 1992, Tokyo: Japan Science Society Press, 111-131.
      21. Kusaba M, Nishio T, Satta Y, Hinata K, Ockendon D: Striking sequence similarity in inter- and intra-specific comparisons of class I SLG alleles from Brassica oleracea and Brassica campestris: implications for the evolution and recognition mechanism. Proc Natl Acad Sci U S A. 1997, 94: 7673-7678. 10.1073/pnas.94.14.7673.PubMed CentralView ArticlePubMed
      22. Yasukochi Y, Kurosaki T, Yoneda M, Koike H, Satta Y: MHC class II DQB diversity in the Japanese black bear: Ursus thibetanus japonicus. BMC Evol Biol. 2012, 12: 230-10.1186/1471-2148-12-230.PubMed CentralView ArticlePubMed
      23. Stern LJ, Brown JH, Jardetzky TS, Gorga JC, Urban RG, Strominger JL, Wiley DC: Crystal structure of the human class II MHC protein HLA-DR1 complexed with an influenza virus peptide. Nature. 1994, 368: 215-221. 10.1038/368215a0.View ArticlePubMed
      24. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.PubMed CentralView ArticlePubMed
      25. Hasegawa M, Kishino H, Yano T: Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J Mol Evol. 1985, 22: 160-174. 10.1007/BF02101694.View ArticlePubMed
      26. Zhang J, Rosenberg HF, Nei M: Positive Darwinian selection after gene duplication in primate ribonuclease genes. Proc Natl Acad Sci U S A. 1998, 95: 3708-3713. 10.1073/pnas.95.7.3708.PubMed CentralView ArticlePubMed
      27. Jukes T, Cantor C: Evolution of protein molecules. Mammalian Protein Metabolism. Edited by: Munro H. 1969, New York: Academic, 21-132.View Article
      28. Vita R, Zarebski L, Greenbaum JA, Emami H, Hoof I, Salimi N, Damle R, Sette A, Peters B: The immune epitope database 2.0. Nucleic Acids Res. 2010, 38: D854-D862. 10.1093/nar/gkp1004.PubMed CentralView ArticlePubMed
      29. Meyer D, Single R, Mack S, Lancaster A, Nelson M, Erlich H, Fernandez-Vina M, Thomson G: Single locus polymorphism of classical HLA genes. Immunobiolology of the Human MHC: Proceedings of the 13th International Histocompatibility Workshop and Conference, Volume I. Edited by: Hansen JA. 2007, Seattle, WA: IHWG Press, 653-704.
      30. Klein J: Origin of major histocompatibility complex polymorphism: the trans-species hypothesis. Hum Immunol. 1987, 19: 155-162. 10.1016/0198-8859(87)90066-8.View ArticlePubMed
      31. Klein J, Sato A, Nagl S, O’hUigin C: Molecular trans-species polymorphism. Annu Rev Ecol Syst. 1998, 29: 1-21. 10.1146/annurev.ecolsys.29.1.1.View Article
      32. Satta Y, Hickerson M, Watanabe H, O’hUigin C, Klein J: Ancestral population sizes and species divergence times in the primate lineage on the basis of intron and BAC end sequences. J Mol Evol. 2004, 59: 478-487. 10.1007/s00239-004-2639-2.View ArticlePubMed
      33. Steiper ME, Young NM: Primate molecular divergence dates. Mol Phylogenet Evol. 2006, 41: 384-394. 10.1016/j.ympev.2006.05.021.View ArticlePubMed
      34. Scally A, Dutheil JY, Hillier LW, Jordan GE, Goodhead I, Herrero J, Hobolth A, Lappalainen T, Mailund T, Marques-Bonet T, McCarthy S, Montgomery SH, Schwalie PC, Tang YA, Ward MC, Xue Y, Yngvadottir B, Alkan C, Andersen LN, Ayub Q, Ball EV, Beal K, Bradley BJ, Chen Y, Clee CM, Fitzgerald S, Graves TA, Gu Y, Heath P, Heger A: Insights into hominid evolution from the gorilla genome sequence. Nature. 2012, 483: 169-175. 10.1038/nature10842.PubMed CentralView ArticlePubMed
      35. Takahata N: Evolutionary genetics of human paleo-populations. Mechanisms of molecular evolution. Edited by: Takahata N, Clark A. 1993, Sunderland, Massachusetts: Sinauer Associates, 1-21.
      36. Takahata N, Satta Y, Klein J: Divergence time and population size in the lineage leading to modern humans. Theor Popul Biol. 1995, 48: 198-221. 10.1006/tpbi.1995.1026.View ArticlePubMed
      37. Satta Y: Comparison of DNA and protein polymorphisms between humans and chimpanzees. Genes Genet Syst. 2001, 76: 159-168. 10.1266/ggs.76.159.View ArticlePubMed
      38. Kim HL, Igawa T, Kawashima A, Satta Y, Takahata N: Divergence, demography and gene loss along the human lineage. Philos Trans R Soc Lond B Biol Sci. 2010, 365: 2451-2457. 10.1098/rstb.2010.0004.PubMed CentralView ArticlePubMed
      39. Riley-Vargas RC, Gill DB, Kemper C, Liszewski MK, Atkinson JP: CD46: expanding beyond complement regulation. Trends Immunol. 2004, 25: 496-503. 10.1016/j.it.2004.07.004.View ArticlePubMed
      40. Guiso N: Bordetella pertussis and pertussis vaccines. Clin Infect Dis. 2009, 49: 1565-1569. 10.1086/644733.View ArticlePubMed
      41. Osterhaus A, Rimmelzwaan G, Martina B, Bestebroer T, Fouchier R: Influenza B virus in seals. Science. 2000, 288: 1051-1053. 10.1126/science.288.5468.1051.View ArticlePubMed
      42. De Groot NG, Otting N, Doxiadis GGM, Balla-Jhagjhoorsingh SS, Heeney JL, van Rood JJ, Gagneux P, Bontrop RE: Evidence for an ancient selective sweep in the MHC class I gene repertoire of chimpanzees. Proc Natl Acad Sci U S A. 2002, 99: 11748-11753. 10.1073/pnas.182420799.PubMed CentralView ArticlePubMed
      43. Gyllensten UB, Sundvall M, Erlich HA: Allelic diversity is generated by intraexon sequence exchange at the DRB1 locus of primates. Proc Natl Acad Sci U S A. 1991, 88: 3686-3690. 10.1073/pnas.88.9.3686.PubMed CentralView ArticlePubMed
      44. Bak EJ, Ishii Y, Omatsu T, Kyuwa S, Tetsuya T, Hayasaka I, Yoshikawa Y: Identification and analysis of MHC class II DRB1 (Patr-DRB1) alleles in chimpanzees. Tissue Antigens. 2006, 67: 134-142. 10.1111/j.1399-0039.2006.00539.x.View ArticlePubMed
      45. De Zorzi M, Caggiari L, Ahlenstiel G, Rehermann B, De Re V: Description of two new major histocompatibility complex (MHC) class II DRB1 [Pan troglodytes (Patr)-DRB1] alleles. Tissue Antigens. 2008, 71: 490-492. 10.1111/j.1399-0039.2008.01027.x.View ArticlePubMed
      46. Wilson D, Reeder D: Mammal Species of the World: A Taxonomic and Geographic Reference. 2005, Baltimore: Johns Hopkins University Press, 3

      This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://​creativecommons.​org/​licenses/​by/​2.​0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://​creativecommons.​org/​publicdomain/​zero/​1.​0/​) applies to the data made available in this article, unless otherwise stated.