Introduction

Behçet's disease (BD) is an autoimmune disorder characterized by recurrent ulcers in the mouth, genitalia, and eyes, with potential multi-organ involvement1,2. It is classified as variable vessel vasculitis due to its impact on blood vessels3. The prevalence of BD is higher in countries along the Silk Road, particularly in the Mediterranean and northern East Asia4,5. Epidemiological studies reveal varying rates, such as 420 per 100,000 in Turkey6 and 35 per 100,000 in Korea, marking the highest prevalence among Asian nations7. In Thailand, a 24-year study (1980–2003) documented 23 cases of BD8. Initially believed to be an auto-inflammatory disease, recent studies suggest it is an auto-inflammatory and auto-immune disorder, with innate immune mutations, vascular diseases, and co-morbid conditions contributing to the auto-inflammatory side and adaptive immune system mutations causing the autoimmune disorder9,10,11.

Genetics are the most significant risk factors for BD, with HLA-B*51 allele variations present in 50–60% of patients12. The precise role of HLA-B*51 in Behçet's syndrome remains uncertain, with ongoing debates about whether it is directly involved in the disease or merely acts as a marker. Other HLA and non-HLA alleles have also been linked to BD. For example, the systematic review, encompassing 11 studies, highlights increased TNF-α production, potentially triggered by TLR-signalling, and underscores TNF-α's pivotal role in BD's immunopathogenesis13. A study in Spain found no significant association between Behçet's disease (BD) and multiple ERAP1 polymorphisms, but increased frequencies of these polymorphisms was seen in patients with HLA-B risk suggesting a potential epistatic interaction between ERAP1 and HLA-B14. Many studies reported this higher likelihood of developing BD in HLA-B*51 positive cases along with homozygosity for the ERAP115,16. Additionally, independent risk alleles such as HLA-B  15, HLA-B  27, HLA-B  57, HLA-A  26, HLA-A*02, HLA-A*27, HLA-B*57, MICA-TM-A6 and TNFX103-1C have been identified in the development of Behçet's syndrome17,18.

HLA-B*51 stands out as the most significant genetic factor in Behcet's syndrome, and its prevalence exhibits variations among clinical clusters19. A meta-analysis involving 4800 patients and 289 healthy controls suggested a 32–52% attributable risk for HLA-B*5112. While numerous HLA alleles have been reported as susceptible in various Asian and European countries, limited studies have explored their association with BD in the Southeast Asian (SEA) population. Earlier studies in Thailand identified HLA-B*51:01 as a major susceptible allele in the Thai population, with HLA-A*26:01 reported as a risk allele in non-carriers of HLA-B*51:0120. BD's most prevalent clinical manifestations involve the skin (38–99%), mucous membranes (mouth 47–86%, genitals 57–93%, eyes 30–70%)21,22, joints (45–60%), cardiovascular system (CVS 9–49%), neurological system (5–10%), and gastrointestinal system (3–26%). Poor prognosis23 is associated with BD patients having ocular24, cardiovascular, central nervous system (CNS), or gastrointestinal involvement. Risk alleles for specific clinical manifestations or phenotypes may vary; for example, a Korean study identified HLA-A*02:07 as a risk allele for skin lesions and arthritis, and HLA-A*26:01, HLA-A*30:04, and HLA-B*51:01 as risk alleles for uveitis, vasculitis, and BD, respectively25.

High-resolution HLA typing is instrumental in pinpointing specific HLA alleles, offering a more precise understanding of an individual's HLA profile and its correlation with BD. Notably, a study in Turkey identified a susceptibility association with HLA-B*51 carriers, while HLA-B*52 was found to have a protective effect26. Similarly, an Italian study revealed a high frequency of the HLA-B*51 allele (62.7%), with HLA-B*51:01 and HLA-B*51:08 being the most common subtypes. The occurrence of ocular involvement was statistically linked to HLA-B*51 positivity and the HLA-B*51:01 and HLA-B*51:08 subtypes27. Another NGS study from Egypt implicated HLA-B*51:08 in legal blindness among BD patients28. Overall, high-resolution HLA typing offers a more detailed and clinically relevant perspective on an individual's HLA alleles, enhancing medical decision-making and enabling personalized treatment strategies for BD. This study in Thailand is among the first to explore the association of both class I and II HLA molecules with BD in high resolution (6-digits) using NGS technology.

Results

Patients’ demographic data and clinical characteristics

According to the ICBD criteria, 17 (30.4%) of the fifty-six BD patients had 4 scores, 24 (42.3%) had 5 scores, and 15 (26.8%) had 6 scores. Twenty-three (41.1%) patients were males, and 33 (58.9%) were females, with an average age of onset of 35.5 years (range age of onset: 12–54 years). Ocular, vascular, skin, CNS, and GI systems are the most involved systems in BD. All BD patients had oral ulcers (56, 100%), followed by genital ulcers 34 (60.7%), uveitis 19 (33.9%), retinal vasculitis 15 (26.8%), epidermal necrolysis (EN) 20 (35.7%), papulopustular 20 (35.7%), leg ulcers 1 (1.8%), vasculitis 8 (14.3%), and arthritis 17 (30.4%). In this study, 23 (41.1%) patients had ocular involvement, which included uveitis and retinal vasculitis. Twenty patients (or 35.7%) had vascular involvement, including vasculitis (thrombophlebitis and deep vein thrombosis) and retinal vasculitis. Epidermal necrolysis (EN), papulopustular lesions, and leg ulcers were the skin involvements that were observed in 31 patients (55.4%). The CNS and GI systems were involved in 3 (5.36%) and 6 (10.7%) patients, respectively. Ten patients (16.1%) had a positive allergy test. Of the one hundred and ninety-two controls, 111 (57.8%) were male and 81 (42.2%) were female, with an average age of 60.8 years (range 60–74 years). Case and control characteristics and clinical data are summarised in Table 1.

Table 1 Demographics and clinical characteristics of the study population.

The association between various HLA Class I and HLA Class II molecules using next generation sequencing (NGS)

The univariate analysis of this study showed that HLA-A*26:01:01 had an association with BD. This allele was carried by 4.7% of control and 12.5% of case groups respectively (OR-3.285, P-value = 0.028, 95% CI = 1.135–9.504). Other HLA-B alleles that have been shown to be strongly associated with BD that includes HLA-B*39:01:01 (OR = 6.16, 95% CI = 1.428–26.712, P-value = 0.015), HLA-B*51:01:01 (OR = 3.033, 95% CI = 1.135–8.103, P-value = 0.027), and HLA-B*51:01:02 (OR = 6.176, 95% CI = 1.428–26.712, P-value = 0.005). In HLA-C, HLA-C*14:02:01 was also found to be associated with BD (OR = 3.485, 95% CI = 1.339–9.065, P-value = 0.001) (Table 2). The univariate analysis of HLA Class II alleles such as HLA-DRB1*14:54:01 (OR 1.924, 95% CI = 1.051–3.522, P-value = 0.034) and HLA DQB1*05:03:01 (OR = 3.00, 95% CI = 1.323–6.798, P-value = 0.008) were found to be associated with BD as well (Table 3). For the multivariate analysis, P-value were adjusted by using Bonferroni’s correction (34 for HLA-A; HLA-A*26:01:01:01 with Pc-value = 0.952, 49 for HLA-B; HLA-B*39:01:01 and HLA-B*51:01:02 with Pc-value = 0.735, HLA-B*51:01:01 with Pc-value = 1.323, 24 for HLA-C; HLA-C*14:02:01 with Pc-value = 0.240, 27 for HLA-DRB1; HLA-DRB1*14:54:01 with Pc-value-0.918, and 16 for HLA-DQB1; HLA-DQB1*05:03:01 with Pc-value = 0.128), there have no HLA alleles shown associated with BD.

Table 2 Analysis of various HLA Class I molecules and BD associations using NGS.
Table 3 Analysis of various HLA Class II molecules and BD associations using NGS.

Haplotype analysis

To elucidate the association among different HLA alleles across multiple loci, pairwise haplotype analysis was conducted. The analysis identified a linkage disequilibrium in the haplotypes HLA-A*24:02:01, HLA-B*51:01:01, and HLA-C*14:02:02 (A–B–C, P-value = 0.033, Pc-value = 0.066). Subsequent pairwise haplotype analyses were performed, revealing a significant correlation in the B–C pair (HLA-B*51:01:01; HLA-C*14:02:01, P-value = 0.01, Pc-value = 0.02) but not in the A–C pair (HLA-A*24:02:01, HLA-C*14:02:01, P-value = 0.45).

Susceptible HLA alleles associated with the ocular/vascular phenotype of Behcet’s disease (BD)

The alleles linked to patients with BD exhibiting ocular involvement were investigated in this study. Among the various reported alleles, the HLA-B*51:01 allele emerged as the most significant risk allele associated with BD patients with ocular manifestations (p = 0.02). At the 6-digit resolution, a noteworthy association was observed between the HLA-DRB1*14:54:01 (OR 11.67, 95% CI 2.86–47.57, p = 0.001) and BD patients with ocular involvement. Conversely, individuals with BD and vascular complications exhibited a higher prevalence of HLA-DRB1*14:54:01 alleles (OR 3.352, 95% CI 1.00–11.19, p = 0.049). However, it is found to be statistically insignificant.

Discussion

This study employed six-digit genotyping to discern risk alleles associated with Behçet's disease (BD) patients, focusing on susceptible allele subtypes and specific phenotypes such as ocular and vascular involvement in Thai BD patients. Our findings of univariate analysis underscore HLA-B*51:01:02, HLA-B*39:01:01, HLA-C*14:02:01, and HLA-DQB1*05:03:01 alleles as the primary risk alleles in BD patients, a correlation consistent with numerous earlier studies across diverse ethnicities. HLA-B*51:01 emerges as the predominant allele associated with BD across diverse ethnicities, underscoring its pivotal role in BD susceptibility. The novel discovery of HLA-B*51 subtypes reflect the genetic diversity within this allele group, suggesting a complex landscape of variations. Recent advancements in genetic research have led to the identification of novel variants such as HLA-B*51:94, HLA-B*51:151, and HLA-B*51:220, each characterized by distinct nucleotide substitutions. Notably, subtype HLA-B*51:08 exhibits amino acid variations compared to HLA-B*51:01, particularly at positions 152 and 156 within pocket E of the HLA molecule. These unique amino acid signatures differentiate HLA-B*51:08 from other subtypes and may influence its association with BD risk. Together, these findings underscore the intricate genetic architecture of HLA-B*51 alleles and their significance in BD susceptibility across different populations27. Notably, a genome-wide association study (GWAS) conducted in Spain identified independent risk alleles, including HLA-B*51:01 and HLA-A*26:0114. Similarly, studies in Turkey and Saudi Arabia confirmed HLA-B*51:01 as a predominant genetic marker in BD patients29,30 Greek research identified the involvement of the MICA-TM A6 allele and HLA-B*51:01 in Behçet's Disease within a European population. Furthermore, research conducted in Spain, comprising 278 BD patients and 1517 healthy individuals, consistently emphasized the significant association of HLA-B*51:01 and HLA-A*03:01. These results were subsequently validated in multiple studies within the same population. The presence of HLA-B*51:01 has also been observed in the Argentinian BD population26, reinforcing its status as the predominant allele in BD patients, irrespective of ethnic background or clinical phenotype. In this study, further subtype analysis of HLA-B*51:01 revealed HLA-B*51:01:02 as the most significant subtype associated with BD. However, the Bonferroni’s correction (multiple variable analysis) we have performed in this study, hasn’t found any association between HLA-B*51:01:02, and BD. This could be attributed to the limited sample size or high data variability, leading to associations observed in univariate analysis failing to reach significance in multivariable analysis.

Nevertheless, achieving a comprehensive understanding of BD diagnosis requires examination of HLA and non-HLA genetic variants, as well as consideration of environmental factors. This is particularly crucial in populations lacking the most common allele, HLA-B*51:01. Previous studies have reported numerous risk alleles, both within the HLA system and beyond. In Thailand, an in-silico analysis revealed the strongest binding affinity for HLA-B*51:01, followed by HLA-B*35:01, HLA-A*26:01, and HLA-A*11:0127. A Korean study, encompassing 223 BD patients and 1398 healthy controls, observed a higher prevalence of the HLA-A*02:07, HLA-A*26:01, and HLA-A*30:04 alleles in BD patients compared to controls19. Notably, this association had odds of 4.19 or greater among patients lacking the HLA-B*51 allele. In Japanese study, HLA-A*26:01 was identified as a risk allele, particularly noteworthy in patients lacking the HLA-B*51:01 allele, and their findings suggested an association with a poor prognosis28. This observation was consistently reported in another GWAS conducted within the same population29. In Middle Eastern countries, Saudi Arabia specifically reported HLA-A*26:01 as a risk allele, alongside others such as HLA-B*51:01 and HLA-A*31 associated with BD22. While HLA-A*26:01 has frequently been identified as a predisposing allele for BD across various populations, in this study, although initially significant, this association lost statistical significance following Bonferroni correction for type 1 error.

In the European population, HLA-B*27 and its subtypes have been linked to BD. Alireza Khabbazi et al.31 conducted a meta-analysis study, establishing a significant relationship between HLA-B*27 and BD across various populations. Ethnic variations were noted, with the prevalence of the HLA-B*27 allele being higher in European populations30. Another meta-analysis study by Capittini et al.32 encompassing diverse global populations, reported HLA-DQB1*03 and HLA-A*26, in addition to HLA-B*51:01 and HLA-B*51:02, as genetic factors associated with BD.

Although previous studies did not find any association between HLA-B*39:01:01 and BD in Thai and other populations, a study conducted in Japan reported this allele's association with familial Mediterranean fever (FMF) in Japanese patients33. Haplotype analysis and linkage disequilibrium are critical in HLA allele and disease association studies. The high genetic diversity within the HLA region necessitates an understanding of how specific alleles are inherited together on the same chromosome. In HLA studies, these analyses offer precision in identifying risk alleles, shedding light on the complex genetic basis of diseases, and facilitating targeted therapeutic approaches for personalized medicine. The results of our pairwise haplotype analysis shed light on the intricate associations among different HLA alleles in the context of BD. Notably, the analysis revealed a significant linkage disequilibrium in the HLA-A*24:02:01, HLA-B*51:01:01, HLA-C*14:02:02 haplotypes (A–B–C, P-value = 0.033, PC-value = 0.066), emphasizing the coinheritance of specific alleles on the same chromosome. This haplotype was reported in one of the Indian studies, especially in south Indians, with a haplotype frequency of 0.57%34. The same haplotype was reported in HLA allele database in the Chinese in Chinese Jingpo minority at 1.04%35. However, this is the first time in the Thai population that this haplotype has been found to be associated in BD patients. Further exploration of pairwise haplotypes led to the identification of a noteworthy correlation within the B–C pair (HLA-B*51:01:01; HLA-C*14:02:01, P-value = 0.01, Pc-value = 0.02). This finding suggests a potential synergistic effect or shared genetic influence between the HLA-B*51:01:01 and HLA-C*14:02:01 alleles in BD. Therefore, when we considered with HLA-B*51:01:01 (OR = 3.033, 95% CI = 1.135–8.103, P-value = 0.027) and HLA-C*14:02:01(OR = 3.485, 95% CI = 1.339–9.065, P-value = 0.001) were associated with BD. HLA-B*51:01:01 can be useful for screening marker in Thai population. In contrast, the A–C pair (HLA-A*26:01:01, HLA-C*03:02:02) did not exhibit a statistically significant correlation (P-value = 0.45), suggesting that the observed linkage disequilibrium may not extend to these particular alleles or that other factors influence their inheritance patterns. These results underscore the complexity of HLA associations in BD and highlight the importance of considering specific allele combinations in understanding the genetic basis of the disease. A worldwide meta-analysis investigating the link between BD and variations in genes of both HLA Class I (A, B, and C) and Class II (DRB1, DQB1, and DPB1) has confirmed that the HLA-B  51;Cw  15 and HLA-B  51;Cw  14 haplotypes were the second and third most frequent haplotypes, respectively, while the HLA-B  51; Cw  16 haplotype was in sixth place32. Further studies with larger cohorts and diverse populations are warranted to validate and extend these findings, providing additional insights into the genetic architecture of BD.

This study's primary limitation was its small sample size, limiting exploration of HLA relationships with CVS, GI, and neurological phenotypes. The exclusive focus was on the association between HLA-B alleles and BD, excluding a broader analysis of all HLA alleles or other phenotypes.

Conclusion

Our research highlights specific HLA associations in Behçet's disease (BD), particularly the significant relation of HLA-DRB1*14:54:01 with ocular manifestations. Comprehensive haplotype analysis revealed a correlation between HLA-B*51:01:01 and HLA-C*14:02:01 suggests a potential synergistic effect in BD. These insights underscore the complexity of HLA associations in BD and stress the importance of considering specific allele combinations for a nuanced understanding. When we considered association between this allele and BD. Therefore, HLA-B*51:01:01 can be useful for screening marker in Thai population. Further studies with diverse populations are crucial for validating these findings and advancing targeted therapeutic approaches in personalized medicine.

Material and methods

Study population

Patients were recruited at Mahidol University's Faculty of Medicine and Ramathibodi Hospital's dermatology and ophthalmology clinics between 2013 and 2018. Patients who scored greater than or equal to four on the ICBD36 criteria and were diagnosed with BD by a rheumatologist and a dermatologist were included in this study. Overall, 56 BD patients were enrolled in the study. Their clinical characteristics and sociodemographic details were obtained from medical records. This study includes a control group from the Electricity Generating Authority of Thailand (EGAT) project.

Genomic DNA extraction

Blood samples were taken in EDTA tubes. DNA was isolated using magnetic-bead technology on the Roche Diagnostics, USA, MagNA Pure Compact automated extraction equipment. The genomic DNA's quality and quantity were analysed using Nano Drop (ND-1000). All DNA was aliquoted and stored at − 20 °C before analysis. The quantity (concentration) of DNA was measured by Qubit® Fluorometer 2.0 (Life Technologies) with Qubit® dsDNA HS Assay Kits (Life Technologies), and the quality (size) of DNA was run with 100–200 mg of known DNA on 1% agarose gel electrophoresis compared with O’GeneRuler 1 kb DNA Ladder (Thermo Scientific). DNA was selected if the concentration is over or equal to 25 ng/µl and the size of DNA was more than 10 kb.

HLA genotyping

HLA class I (HLA-A, -B, C) and HLA class II (HLA-DRB1, -DQB1, DPB1) amplicons were generated using the NXTypeTM reagents. And the amplicons were amplified according to One Lambda, USA protocol for PCR amplicons greater than 3 kb. Afterwards, the amplicons underwent quantity check using Qubit® dsDNA HS Assay Kits (Life Technologies), and DNA 12K assay, DNA Extended Range Chip (12K) (PerkinElmer, USA).

HLA class I and class II amplicons (6-types) were pooled together in equimolar proportions and subsequently purified with AMPure® PB (Pacific Biosciences, USA). SMRTbell libraries were then generated using the SMRTbell Barcoded Adapter Complete Prep kit–96 (Pacific Biosciences, USA). Individual samples, consisting of pooled HLA amplicons, were end-repaired, and ligated to unique SMRTbell barcoded adapters in a single reaction. Following ligation, the libraries were pooled and further purified using AMPure® PB. The resulting purified libraries were prepared for sequencing with the Sequel Binding and Internal Control Kit 3.0 (Pacific Biosciences, USA) and were subsequently sequenced on PacBio Sequel System (Pacific Bioscience, USA). Sequencing was performed using the Sequel sequencing kit 3.0 and SMRTcell 1M v3 (Pacific Bioscience, USA) with 10 h movie time.

Demultiplexed, high-quality consensus sequences (Long Amplicon Analysis; LAA reads) of each sample were generated in FastQ from raw sequence data using the accompanying analysis software suite, SMRT Link version 9.0, with given parameters of at least ten passes and 99.9% accuracy.

For downstream analysis, a custom full-length HLA reference alleles was created from HLA class I (HLA-A, -B, -C) and HLA class II (HLA-DRB1, -DQB1, -DPB1) reference allele sequences (IMGT/HLA release 3.38.0). Due to lack of standardized analysis guidelines for long-read HLA data, custom analysis pipeline was used (“Supplementary Information”).

Briefly, for each sample, the consensus LAA reads were aligned against the custom full-length HLA reference alleles using minimap2 version 2.837. From the alignment outputs, numbers of LAA reads fully mapped to each HLA alleles are generated and the list of ‘hit’ HLA alleles for each HLA gene was sorted according to highest numbers of mapped LAA reads. For each sample, a maximum of four candidate HLA alleles per HLA gene were selected based on the highest numbers of mapped LAA reads and at least 50 mapped LAA reads. In most cases, there were two candidate HLA alleles (sometimes, only one) per HLA gene per sample with much higher number of mapped LAA reads. These HLA alleles were then assigned as the HLA alleles within the sample (HLA genotyping).

To assure the accuracy of assigned HLA alleles whether each assigned HLA allele was a known HLA allele or probably a novel HLA allele, a representative consensus sequence obtained from the LAA sequences matching each candidate HLA allele was aligned against the current version of reference HLA alleles in IMGT/HLA database, using the IPD-IMGT/HLA Sequence Alignment Tool on the EBI website; https://www.ebi.ac.uk/ipd/imgt/hla/alignment/. The finalized candidate HLA alleles were assigned to each sample for each of the six HLA class I and class II genes.

Statistical analysis

HLA alleles were tested for their association with BD by calculating the Odds ratio; 95% Confidence Interval using Fisher’s exact test. STATA version 12 for Windows was used to analyses all tests. The corrected P-values (Pc-value) for multiple comparison of HLA alleles (34 for HLA-A, 49 for HLA-B, 24 for HLA-C, 25 for HLA-DPB1, 27 for HLA-DRB1, and 16 for HLA-DQB1) were calculated using Bonferroni’s correction. P-value were less than 0.05 was considered statistically significant. To reduce the likelihood of type 1 error, Bonferroni correction was applied. Following this correction, a significance threshold of p = 0.01 (< 0.05/4—two-tailed) was adopted. Haplotype association analysis was carried out using ‘haplo.stats’ R Studio software packages.

Ethics approval

Our study strictly adheres to the Declaration of Helsinki's principles. Participants receive a clear and understandable patient information sheet along with a comprehensive informed consent form before participating. We exclude vulnerable groups (children, pregnant women, prisoners, and those with diminished decision-making capacity). Our research protocol underwent rigorous ethical review by an accredited board, which approved it based on the highest ethical standards by the Committee for Research, Faculty of Medicine Ramathibodi Hospital Mahidol University (COA. MURA2020/16).