Research - (2023) Volume 8, Issue 2
Received: 15-Mar-2023, Manuscript No. JFM-23-91785;
Editor assigned: 17-Mar-2023, Pre QC No. P-91785;
Reviewed: 29-Mar-2023, QC No. Q-91785;
Revised: 04-Apr-2023, Manuscript No. R-91785;
Published:
12-Apr-2023
, DOI: 10.37421/2472- 1026.2023.8.192
Citation: Iliev, Plamen, Vera Djeliova, Ekaterina Angelova and Bogdan
Mirchev, et al. “Allelic Frequencies and Statistical Parameters Determination, for 16
Autosomal STR Markers Included in the NGM Detect™ PCR Amplification Kit, for the
Bulgarian Population.” J Forensic Med 8 (2023): 192.
Copyright: © 2023 Iliev P, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
In the presented study, the allelic frequencies and statistical parameters for 500 unrelated individuals from the 28 administrative regions of the Republic of Bulgaria were determined for 16 autosomal genetic markers (STR), included in the NGM Detect™ PCR Amplification Kit (Applied Biosystems), for the Bulgarian population. No deviations from Hardy-Weinberg equilibrium for the 16 studied loci were observed, except for a deviation for the D8S1179 and FGA loci. The most polymorphic and informative for the Bulgarian population is the SE marker, with the next most informative markers being: D1S1656, D12S391 and D18S51. For all 16 loci, the combined value Match Probability (MP) is 1.20e-21 and the power of exclusion (PE) is 99.999999022%.
The results obtained from the DNA analysis have a very high identification value. Their interpretation is particularly important when a DNA profile found at a crime scene matches that of a suspect or an injured person. Such statistical estimates are also valuable in detecting mixed DNA profiles, as well as in calculating the probability of kinship between two or more individuals. For these calculations, it is necessary to perform a statistical analysis of the occurrence frequency of the specified STR variants among a given population group of individuals. So far, no such research representative of the Bulgarian population has been published.
DNA analysis • STR • Bulgarian population data • STR population database • NGM Detect™ PCR amplification kit
Short tandem repeats (STRs) form approximately 3% of the total human genome and occur on average once in every 10,000 nucleotides [1]. When applying a DNA-based identification method, determining the allelic frequencies for the target STR loci and their statistical parameters for each population, is essential to provide a more accurate reference database for the forensic investigation. Estimating the STR allele frequencies is an essential prerequisite for this type of analysis to be applicable in forensic practice. This is because each population has its own composition of allele distribution. STR allele frequencies estimation of the target population ensures correct weightof- evidence calculations in court practice, including when deciding cases of disputed parentage.
The goal of the present study is to determine the allele frequencies and their statistical parameters for the Bulgarian population for 16 autosomal genetic markers included in the NGM Detect™ PCR Amplification Kit: D2S1338, SE33, D16S539, D18S51, TH01, D12S391, D3S1358, FGA, vWA, D21S11, D1S1656, D2S441, D8S1179, D19S433, D22S1045, D10S124, by deriving the genetic profiles of 500 unrelated individuals from the 28 administrative districts of the Republic of Bulgaria.
With an increasing number of laboratories using STR marker technology, more and more STR data of the population is being reported in Europe and around the world, allowing regional genetic comparison of the population. The diversity of STRs in the genome is the result of DNA polymerase slippage during the replication of the DNA, or mismatch of the basic group of slippage strand and complementary strand in the process of DNA replication and repair, resulting in one or more missing repetitives or inserted units [2-4].
In the present study 500 unrelated individuals from the Bulgarian population, from the 28 administrative regions of the Republic of Bulgaria, were examined. The analyses were conducted in the DNA laboratory at the University Hospital "Alexandrovska"-Sofia and covered the period from 2017 to 2022. Declarations of informed consent were signed from the examined living persons for the samples obtained. In the declaration of informed consent, only the regional origin of the volunteer is disclosed, in compliance with the legislative requirements for control when working with personal data of by the Republic of Bulgaria and the EU. When expert samples were provided to us by the investigating authorities for performing forensic examinations for the identification of the perpetrator of a criminal act, or the establishment of kinship or the identification of corpses of unknown identity, these forms were not applicable. The research has been conducted in accordance with the Helsinki declaration and approved by the ethics committee of the Medical University of Sofia (Protocol No 06/16.05.2022).
Sixteen autosomal genetic markers (STR) included in the NGM Detect™ PCR Amplification Kit (Applied Biosystems) were used to determine the allelic frequencies and statistical parameters of the 500 said individuals: D2S1338, SE33, D16S539, D18S51, TH01, D12S391, D3S1358, FGA, vWA, D21S11, D1S1656, D2S441, D8S1179, D19S433, D22S1045, D10S124.
The DNA profiles were isolated by Chelex100 extraction [5] from buccal mucosa samples of living individuals and from biological samples taken during routine forensic practice.
Multiplex PCR amplification was performed with 0.5 ng of genomic DNA for 16 somatic loci and the Amelogenin sex-determining system, according to the manufacturer's instructions, performed in two steps at:
• Real-Time PCR system 7500 (Life Technologies) with PC Notebook, for HID analyses-with Quantifiler™ Trio DNA Quantification Kit and quantitative evaluation of the DNA present in the samples by HID Real-Time PCR Analysis Software v1.2.
Fragment analysis was performed on a Genetic Analyzer model 3500 Series Genetic Analyzers for Human Identification (Life Technologies) by 8 capillary electrophoresis (with 3500 POP- 4™ Polymer) with laser detection of the fragments and computer analysis using Gene MapperTM v1.2 Full Software (Life Technologies) for HID analysis.
The control and standardization of analyzes was carried out by:
• Positive control-DNA Control 007; negative control-NS; Matrix Standart DS-33 3500 Series (6- FAM™, VIC™, NED™, PET™, LIZ™ dyes); internal standard-GeneScan™ 600 LIZ ™ Size Standart v2.0; internal quality control markers-IQCS and IQCL; an allelic witness (NGM Detect™ Allelic Ladder) for the relevant STR markers, validated and embedded in the NGM Detect™ Kit (Applied Biosystems).
Statistical analyzes of the generated DNA profiles were performed with analysis programs: FORSTAT- Forensic statistics analysis toolbox (https://fdluwc.shinyapps.io/forstat) [6] and STRAF software Version 1.0.5. http://cmpg.unibe.ch/shiny/STRAF/ [7,8].
The following were analysed for: expected heterozygosity (H exp), observed heterozygosity (H obs), homozygosity (Ho), power of discrimination (PD), polymorphism information content (PIC), match probability (MP), exclusion power (PE), typical paternity index (TPI) and for a test of Hardy- Weinberg equilibrium (pHW).
For the statistical analysis, 500 STR profiles were used, taken from samples of buccal mucosa of living persons and expert samples taken during routine forensic practice from the 28 administrative regions of the Republic of Bulgaria. The research was carried out in the DNA laboratory at the Department of Forensic medicine and Deontology at the University Hospital "Aleksandrovska"- Sofia. In order to enter the generated DNA profiles into an information array, the Familias software, v.3.2.8 (https://familias.no) [7] was used.
The allele frequencies of 16 autosomal genetic markers (STR) for the Bulgarian population (n= 500), calculated using statistical programs, are presented in tabular and graphical form, respectively, in Figure 1 and Figure 2 and the calculated statistical parameters are summarized in Figure 3.
Figure 2. Statistical parameters of 16 STR loci in the Bulgarian population (n=500). N: number of alleles for 500 individuals in locus, N all: number of observed alleles in locus, Hexp: Expected Heterozygosity Ho: Observed Heterozygosity; Ho: Homozygosity; PD: Power of Discrimination, PIC: Polymorphism Information Content, MP: Matching Probability; PE: Power of Exclusion; TPI: Typical Paternity Index, pHW: Hardy-Weinberg equilibrium exact test (number of permutations: 1000).
When analyzing the established allelic variants of the analyzed markers for the Bulgarian population, it was established that markers SE33, FGA, D21S11, D12S391, D19S433 and D1S1656 show greater variability. Marker SE33 was found to present 47 different alleles. FGA and D21S11 presented 18 allelic variants and markers D12S391, D19S433 and D1S1656 presented 17 respectively. On the other hand, those with the lowest number of alleles detected in the population were the loci: TH01 with six alleles and D16S539 with eight allelic variants.
The average value of Hobs is 0.8291 and the value of Hesp is 0.8236. The highest Hobs index was obtained in marker SE33 (0.952), followed by D18S51 (0.9) and D1S1656 (0.896). With the highest value of Hesp was SE33 (0.9459), followed by D1S1656 (0.898) and D12S391 (0.8835). As for the lowest indices obtained, in the Hobs index these are markers D2S441 and D22S1045 (0.752) and the same markers in the Hesp index: D2S441 (0.7433) and D22S1045 (0.7311).
PD is characterized by the probability that two unrelated and randomly selected individuals can be genetically differentiated by an analysis of a marker or set of markers. Analyzing the results shown in Figure 1, it can be seen that the PD index for marker SE33 is 0.9927, which is the highest, followed by D1S1656 (0.9795) and D12S391 (0.9738). The markers with the lowest PD index were D2S441 (0.891) and D22S1045 (0.8889). The average PD of the markers is 0.939875.
The PE parameter allows us to determine the proportion of individuals falsely included in an expert assessment. The most informative marker was found to be SE33 (0.9023), followed by D18S51 (0.7954) and D1S165 (0.7873), while the least informative markers were found to be D22S1045 and D2S441 (0.5132). The combined Power of Exclusion (PE) is 0.99999999022.
The three most informative STRs for the Bulgarian population, due to their PIC, were found to be SE33 (0.9433), followed by D1S1656 (0.889) and D12S391 (0.8725). The markers with the lowest index were D22S1045 (0.6898) and D2S441 (0.7025). Allele frequencies of all 16 studied autosomal STR markers have PIC values higher than 0.5, which meets the condition of being sufficiently informative for the studied population [9].
Match Probability (MP) values ranged from 0.0073 (SE) to 0.1111 (DS221045) with a combined value of 1.20e-21. The marker with the highest match probability index was D22S1045 (0.1111), followed by D2S441 (0.1090) and D10S1248 (0.0897), while the marker with the lowest index was SE33 (0.0073), followed by D1S1656 (0.0205) and D12S391 (0.0262).
Typical Paternity Index (PItypical) ranges from 2.0161 (DS441 and DS221045) to 10.4167 (SE33).
The Hardy-Weinberg equilibrium test (p-value>0.05) showed a bias in equilibrium in two genetic markers: D8S1179 (p=0.003) and FGA (p=0.011). However, this deviation was not significant after applying the Bonferroni correction [10].
After the analysis of the 16 STR markers contained in the NGM Detect™ PCR Amplification Kit, it was found that the most polymorphic and informative locus for Forensic Identification of the Bulgarian population is SE33. This marker has the highest PIC, PD, PE, PI and the lowest frequency of MP and homozygosity. For the Bulgarian population, the next most informative markers are: D12S391, D1S1656 and D18S51. These markers have the highest PIC, PD, PE, PI and the lowest value of MP and homozygosity. The lowest levels of PIC, PD, PE, PI and the highest values of MP and homozygosity were found for two markers: D2S441 and D22S1045. All studied loci except D8S1179 and FGA were in the Hardy-Weinberg equilibrium.
The results obtained from the DNA analysis have a very high identification value and their interpretation is particularly important when a DNA profile found and seized from a scene of a crime matches that of a suspect or injured person. In these cases, it is mandatory to calculate what is the statistical probability that another, randomly selected person from the population, apart from the specified one, has the same profile. Such statistical estimates are also valuable in the detection of mixed DNA profiles from two or three individuals, the identification of unidentified individuals and the calculation of the probability of kinship. For these calculations, it is necessary to perform a statistical analysis of the occurrence frequency of the specified variants of the short tandem repeats among a given population group of individuals. As until now, such representative study of the Bulgarian population has not yet been published.
With the growing number of laboratories that use STR technology, more and more data collected in Europe and around the world is being reported on the ST distribution among the population, which allows comparison between regional populations, including the Bulgarian one.
Of the 16 genetic markers examined, the most polymorphic and informative locus for forensic identification was found to be SE33. For the Bulgarian population, the next most informative markers are: D1S1656, D12S391 and D18S51. The lowest levels of PIC, PD, PE, PI and the highest values of MP and homozygosity were found for two markers: D2S441 and D22S1045. All studied loci except D8S1179 and FGA were in Hardy-Weinberg equilibrium. However, this deviation was not significant after applying the Bonferroni correction.
We believe that by carrying out a DNA population study for the Bulgarian population for the frequency of occurrence of alleles and calculation of statistical parameters in established genetic loci, as established in world practice, that that significally contributes to the collection and construction of the world DNA characteristics library of the population. Such library is of great value for the purposes of forensic and criminological investigations for the identification of corpses with unknown identity, as well as for the resolution of cases of disputed parentage. Thus, this data should be popularized and be used by other countries when conducting comparative studies of DNA profiles of persons from the Bulgarian population. This data is especially valuable due to the possibility of free movement of the Bulgarian citizens and when combined with the data from relatively significant Bulgarian diasporas in various countries around the world, that would, in cases of need, be of essential help when comparative identification studies must be conducted. The availability of DNA population data will ensure the accuracy of the mathematical calculations of the results of these studies, as well as their probative value.
This study was realized thanks to a program for financing scientific research - competition "GRAND- 2022" of the Medical University - Sofia, Bulgaria /Contract № D- 176/14.06.2022/.
The authors declare that they have no competing interests. The funders had no role in the study design; in the collection, analysis or interpretation of data; in writing the manuscript; or in the decision to publish the results.
None.
Google Scholar, Crossref, Indexed at
Google Scholar, Crossref, Indexed at
Google Scholar, Crossref, Indexed at
Google Scholar, Crossref, Indexed at
Google Scholar, Crossref, Indexed at
Journal of Forensic Medicine received 165 citations as per Google Scholar report