Fisheries and Aquatic Sciences

The Korean Society of Fisheries and Aquatic Science

Fish Aquat Sci 2025; 28(3):152-162

eISSN: 2234-1757

DOI: https://doi.org/10.47853/FAS.2025.e14

RESEARCH ARTICLE

Molecular and phylogenetic analysis of Sardines at the fish landing center-Tanjung Luar-east Lombok using DNA sequences of the CO1 gene

H. Mahrus Mahrus¹^,^*

, Karnan Karnan¹

, Prapti Sedijani¹

¹Program of Biology Education, Department of Sciences Education, Faculty of Teacher Training and Education, University of Mataram, Mataram 83125, Indonesia

^*Corresponding author: H. Mahrus Mahrus, Program of Biology Education, Department of Sciences Education, Faculty of Teacher Training and Education, University of Mataram, Mataram 83125, Indonesia, Tel: +62-370-623873, Fax: +62-370-634918, E-mail:mahrus@unram.ac.id

Copyright © 2025 The Korean Society of Fisheries and Aquatic Science. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Received: Sep 08, 2024; Revised: Oct 13, 2024; Accepted: Oct 28, 2024

Published Online: Mar 31, 2025

Abstract

Sardine (Sardinella spp.) of the Clupeidae family is known as a fish with high economic value because it contains high omega-3 functioning to maintain heart and brain health. It has many similarities that are difficult to distinguish between one species and another. One method used to analyze kinship relationships based on similarity of characters is phylogenetics. Groups of organisms with similar characteristics are to have a close relationship biologically. This study aims to validate the Sardine species and its phylogenetic relationship with various Sardines using the mitochondrial cytochrome c oxidase subunit 1 (CO1) gene sequences. The Sardine sample used in this study was a Sardine caught by small-scale fishermen from several locations on Lombok Island and landed in the fish landing center of Tanjung Luar. The total fish samples amounted to 100 individuals using random methods. Firstly, the Sardine samples were grouped based on morphological characters. Next, the extraction and amplification of the DNA of 20 Sardines has the potential to be a candidate DNA marker for Sardines pecies using a commercial kit (DNeasy Blood & Tissue Kit, Jena Bioscience, Jena, Germany). The amplification of CO1 rRNA target genes utilized the universal primer pair successfully found 688 bp and was sequenced. The CO1 gene sequence of Sardines was analyzed using Molecular Evolutionary Genetics Analysis Version 11 (MEGA 11). The phylogenetic relationships based on a sequence of the CO1 gene showed the Sardine landing fish of Tanjung Luar, East Lombok-Indonesia, can identify the CO1 gene from 13 individuals obtained from Gen-Bank, which indicates a close genetic relationship between Sardine from Tanjung Luar and Sardinella aurita. Genetic variation and population structure among 14 individuals based on molecular characteristics were analyzed using DnaSP v6. The genetic diversity value of Sardine is relatively high (average 0.97436) based on the haplotype diversity value, but it has a low population structure (F_st = 0.37488). The high genetic diversity of Sardines still makes significant stability, and populations can adapt to individual environmental conditions and cope with disturbances, diseases, and climate changes. Next, it is necessary to fully support the best strategic conservation management program for the Sardine population sustainably.

Keywords: DNA; Mitochondrial; Phylogenetic; Sardine; Sequence

Introduction

The geographical location of Indonesia’s islands on the planet connects two oceans, i.e., the Pacific and the Indian Ocean, affecting Indonesia’s waters as a place of water masses interchange with different characteristics. The Indian Ocean is warmer than the Pacific Ocean. Temperatures vary according to location and the ocean’s currents. At the same time, the water exchange occurs in the middle of the Indian Ocean and the Pacific Ocean, which causes the Indonesian Sea to be a quality habitat for various marine fishes, estimating more than 33,000 fish species alive. About 70% of them inhabit the Pacific Sea waters, which exists as the Great Sea, and more than 20% of the rest come from the Atlantic Ocean, while 8% come from the Indian Ocean with warmer waters characteristics (Vishnupriya et al., 2022). Several 4,954 species in 324 families are from the area currently known (Peristiwady, 2021). Next, over half of the total number of specimens collected present the families of Pomacentridae, Labridae, Gobiidae, Apogonidae, Chaetodontidae, Serranidae, Acanthuridae, Lutjanidae, Carangidae, and Scaridae. These families comprise about 48.9% of the total species reported. The Ichthyofauna dominates reef-associated fishes and pelagic or benthic fishes inhabiting offshore habitats.

Lombok Island, located in eastern Bali Island, links the Java Sea to the Indian Ocean and is well known as one of the dominant main water flow passages in the Indonesian Sea. Along the south coast of the Sumatra-Java and southern waters of the Lombok Island causes an increase in water mass (upwelling) in the east season (June to November), which causes the increasing inside seawater mass with low-temperature characteristics, high salinity, and rich in nutrients. This phenomenon increases the amount of chlorophyll and water fertility and then causes fish to be abundant (Eisele et al., 2021; Hendrawan & Asai, 2011).

The Genera Sardines (Sardinella) from the family Clupeidae, which come from the Mediterranean Sea, are small pelagic tropical marine fish having the principal economic value (Hunnam, 2021). Sardines, in Indonesia waters, are the third largest leading commodity after tuna and shrimp, whose producing shaws tend to increase and have been hazardous due to over-exploitation, pollution, habitat destruction, and climate change in the last ten years (Suherman et al., 2020). The other small pelagic fish, Spratelloides delicatulus, is one type of anchovy in the waters south of Lombok and likewise has experienced exploitation (Mahrus et al., 2022).

The large Sardine area in Indonesia is the Bali Strait, which is close to the Lombok strait, dominated by Sardinella lemuru Bleeker 1853 (Tinungki & Sirajang, 2019). They consist of five genera, and at least 21 unique species are ambiguous because of many similarities that are difficult to distinguish between one species and another (Thomas et al., 2014). Sardine species in Indonesia have names. i.e., Clupea (Harengula) longiceps (C.V.), S. lemuru Bleeker, 1853, Sardinella Amblygaster, Sardinella atricauda, Sardinella longiceps, Sardinella fimbriata, S. clupeoides, S. lemuru (Bali Sardinella), etc., and all species in Indonesia are called lemuru (Hunnam, 2021; Suherman et al., 2020).

The preliminary studies reported that the genetic diversity of Sardine is slightly higher than other marine fishes even though there is low genetic distance and variation between their populations, such as S. lemuru Bleeker 1853 in the Bali Strait, northern and southern-east Java (Sartimbul et al., 2018; Thomas et al., 2014). The use of mitochondrial DNA c oxidase subunit 1 (CO1) genes is a very effective method for identifying characterizing and identifying animal species quickly and accurately (Mohanty et al., 2015; Wang et al., 2018). The effectiveness of the CO1 genes is low in intraspecific variation, while interspecific variation is high, especially in adjacent taxa (Choi et al., 2020). The Fauna of Indonesia already has DNA barcodes of a few numbers on the report Center for Indonesian Institute of Sciences, consisting of mammals, birds, Komodo dragons, and insect pests (Rahayu & Jannah, 2019).

Bingpeng et al. (2018) said that the molecular technique is an alternative method for difficult-to-identify species. The number and names of Sardine species in Lombok, based on molecular characters using DNA CO1 genes or other genes until present, have not yet been reported, so it is still ambiguous taxonomy. Based on taxonomic uncertainty, this research aims to determine molecular characteristics, genetic diversity, and kinship for Sardines in Tanjung Luar fish landing center using the sequence of CO1 genes. It is located in Tanjung Luar Village, East Lombok Regency, West Nusa Tenggara Province, and has an area of approximately 222 km² with a coastline of 220 km and a sea area of 1,074.33 km². It has significant opportunities to develop fishery activities for capture fishing and culture fishery. To utilize fish resources optimally, wise management of marine and fisheries resources is needed. Therefore, this research will be crucial in elucidating.

Materials and Methods

Location of sampling and sample collection

The Sardine sampling lasted from May 10 to September 12, 2024, in the fish landing center of Tanjung Luar, caught from several locations on Lombok Island by small-scale fishermen (Fig. 1). It is located in Tanjung Luar Village, East Lombok Regency, West Nusa Tenggara Province. The distance between Tanjung Luar Village and the capital of East Lombok Regency is 18 km, and 65 km to West Nusa Tenggara Province. Tanjung Luar Village has an area of approximately 222 km² with a coastline of 220 km and a sea area of 1,074.33 km². It has quite a great potential for developing fisheries businesses for both capture fisheries and aquaculture.

Fig. 1. Sampling locations of Sardines spp. in Tanjung Luar (116°30’59”E, 8°46’39”S).

Download Original Figure

Collecting Sardines samples amounted to 100 individuals using random methods. A Sardine grouping to the genus Sardinella based on their morphological characteristics, including body shape, body size, head shape, head size, and others (Carneiro et al., 2019). Sardine samples used for molecular genetic analysis were mantle tissue (±1 cm) cut using surgical scissors. Next, put and preserve it into 5 mL sample tubes with 96% alcohol and save it at the Immunology Laboratory, Faculty of Mathematics and Natural Sciences, University of Mataram.

Morphological and molecular characteristics identification

Taxonomic identification was carried out up to the species level whenever possible. Genetic diagnosis, i.e., meristic counts and proportional measurements of collected specimens, was accomplished following (Sidiq et al., 2021). The morphological characters were observed and recorded in every sample in the premade datasheet. Determining fish species used nomenclature following the Food and Agriculture Organization (FAO) Fish Identification Sheets (Hendrikx, 2014).

There are 20 individuals for molecular characteristics identification. Fixating them previously first used 96% alcohol and saved in the Immunology Laboratory, Sciences Faculty, University of Mataram. Molecular identification in this study used the CO1 genes. They are the best solution currently identifying several fish and other fauna species because they work quickly and accurately (Bingpeng et al., 2018). Additionally, these genes can recognize species of conservative nucleotide base sequence and undergo only minor variations, deletions, and insertions (Miya et al., 2015; Phillips et al., 2019).

Extraction and amplification of c oxidase subunit 1 (CO1) gene DNA

The Sardine sample used in extraction and DNA amplification of the DNA Sardines amounted to 20 individuals from 100 Sardine samples. Extraction and amplification of Sardine DNA has the potential as a candidate DNA marker for Sardine species using a commercial kit (DNeasy Blood & Tissue Kit, Jena Bioscience, Jena, Germany) with several modifications according to tissue type or using 10% Chelex solution (Barrientos-Villalobos & Schmitter-Soto, 2019) at 95°C. DNA was extracted from a small piece of ethanol-preserved muscle tissue using a slightly modified method according to the standard DNA method for fish (Hellberg et al., 2014). The amplification of CO1 rRNA target genes utilized the universal primer pair, ie. Primer Forward 5’-ATCTTTGGTGCATGAGCAGGAATAGT-3’, and followed the Primer Rivers: 5’-ACTTCAGGGTGACCGAAGAATCAGAA-3’ (Nuryanto et al., 2019; Ward et al., 2005).

In this study, the amplicon is approximately 680 bp from the 5’ region of the mitochondrial COI gene. Polymerase chain reaction (PCR) amplification conditions are as follows: denaturation for 5 min in 35 cycles at temperatures of 95°C (30 s), 50°C (30 s), and 72°C (50 s). The PCR reaction used a total volume of 50 μL containing one μL of template DNA, 10 mm Tris-HCl (pH 9), 50 mm KCl, two mm MgCl₂, 0.2 μm of each primer, and 0.2 mm of each dNTP, and one U Taq polymerase. PCR products migrated using a 1.2% agarose gel visualized under UV light and documented using gel photographs. Then, PCR products through PT Genetica Science Indonesia for sequencing. The results of DNA sequencing of the CO1 genes were sent to researchers in file AB1 through email. Next, conduct a match between the electropherogram and the DNA sequence obtained in this study. The sequences obtained from a pair of CO1 primers were used as forward and reverse primers, each with reverse complementation to be the reverse. The next step is to align the two sequences using the W cluster menu (Katoh et al., 2019).

Interpretation of c oxidase subunit 1 (CO1) gene sequence data

The CO1 gene sequence data was analyzed using MEGA 11 Software (https://www.megasoftware.net/), edited, and aligned using Clustal W to see the diversity of nucleotide bases (Tamura et al., 2021). The analyses of sequences used references of various species belonging to the Clupeidae family by the National Center for Biotechnology Information (NCBI) GenBank. Sequence alignments are also subjected to a basic local alignment search tool (BLAST) nucleotide search to determine their identity.

CO1 gene sequence data for all Sardine species available in GenBank were included in subsequent analyses using MEGA 11 Software (Tamura et al., 2021). This study elucidates the consensus sequences in each specimen using the Staden Package. Next, sequence differences within and between species were analyzed and tested using bootstrap tests with 1,000 replications.

Genetic variation analyses

Median-joining haplotype networks of both COI gene sequences (688 bp) used DnaSP v6 for specimens of Sardine (Rozas et al., 2017). The use of analysis of molecular variance (AMOVA) to test genetic differentiation among several groups based on molecular characteristics (Doğan & Doğan, 2016).

Phylogenetic analyses

COI gene sequences were aligned, and the optimal substitution model used MEGA11 (Tamura et al., 2021). Phylogenetic trees were constructed based on the concatenated data of the two gene sequences using neighbor-joining (NJ) tree methods with 1,000 bootstrap replicates. Genetic distance matrix calculations used the Kimura-2 parameter model implemented in pairwise distance calculation in MEGA 11 Software.

Results

Morphological and molecular characteristics of Sardines

Sardines are a species of Actinopterygii fish from the Clupeidae family. The morphological form of the Sardine (Fig. 2) has characterization, i.e., its slender or elliptical body with a greenish-blue color on its back and a silvery underside. The body length can reach up to 23 cm, with an average length of 20 cm. The abdomen is slightly rounded, and the spines are blunt and not prominent. From the top of the gill cover to the base of the tail, there is a row of 10–20 black dots. The fins are yellowish-gray, but the tail fin is blackish, and the tip of the snout is also blackish.

Fig. 2. Sardine from the fish landing center-tanjung luar-east Lombok of Indonesia.

Download Original Figure

DNA sample isolated from the tissue of Sardine using a commercial kit (DNeasy Blood & Tissue Kit, Qiagen Cat. No. 69504, Jena Bioscience) successfully indicated by the qualitative test results of 1% agarose gel electrophoresis. The amplification of COI gene fragments from Sardines using a pair of universal COI primers, LCO-1490 and HCO-2198, successfully got amplicon lengths of around 688 bp (Fig. 3).

Fig. 3. The representation of polymerase chain reaction (PCR) products of the c oxidase subunit 1 (CO1) gene from the Sardines population at fish landing center of tanjung luar, Indonesia.

Download Original Figure

The results of tracking the CO1 gene sequence for 13 Sardine species in the NCBI and barcode of life data systems (BOLD) databases show that not all the BLAST sequences have the same length as the sample sequences (Table 1). BLAST aims to find regions of local similarity between sequences.

Table 1. Basic local alignment search tool (BLAST) analysis of 13 species of mitochondrial DNA c oxidase subunit 1 (COI) genes from genera Sardinella, Family Clupeidae obtained from the website of the National Center for Biotechnology Information (NCBI)

No	Accession number	Species name	Sequence length (bp)	Authors	Country
1	HQ231357.1	Sardinella fimbriata	650	Quilang et al. (2010)	Philippine
2	OK445670.1	Sardinella albella	676	Afrand et al. (2021)	Iran
3	OQ387818.	Amblygaster sirm	655	Bemis et al. (2023)	USA
4	MT272807.1	Sardinella maderensis	645	Afriyie (2020)	Sinegal
5	MN392930.1	Sardinella gibbosa	675	Bhaskar (2019)	India
6	MN586892.1	Sardinella sindensis	570	Kumar et al (2019)	India
7	MT293977.1	Sardinella lemuru	634	Labrador et al. (2021)	Philippine
8	KP001108.1	Sardinella longiceps	549	Sukumaran et al. (2014)	India
9	JQ365537.1	Sardinella sp.	652	Ribeiro et al. (2012)	Brazil
10	AM911174.1	Sardinella janeiro	594	Jerome & Martinsohn (2008)	France
11	MK585640.1	Sardinella hualiensis	636	Chan et al. (2019)	Philippine
12	MH141144.1	Sardina pilchardus	641	Ferentis et al. (2018)	Greece
13	OQ459661.1	Sardinella aurita	436	Uyan et al. (2023)	Mediterranean
14	LBK2024	Sardinella spp.*	688	Present Study	Indonesia

The asterisk denotes a fish sample from the fish landing center-tanjung luar-east Lombok of Indonesia.

Download Excel Table

Nucleotide composition

The nucleotide composition found in the COI fragment of Sardine from Lombok Strait is T = 29.0%, C = 31.1%, A (20.9%), and G (19.0%), with the nucleotide composition A + T = 49.9% and G + C= 50.1% (Table 2). Fragment length of the COI of Sardines ranged from 436 to 688 bp.

Table 2. Nucleotide composition of c oxidase subunit 1 (CO1) genes fragment of 4 family Clupeidae under Kimura 2-parameter model

No	Accession no	Species name	T(U)	C	A	G	A + T	G + C
1	HQ231357.1	Sardinella fimbriata	28.5	29.2	22.1	20.2	50.6	49.4
2	OK445670.1	Sardinella albella	28.5	29.5	21.1	20.9	49.6	50.4
3	OQ387818.1	Amblygaster sirm	27.6	31.1	22.8	18.5	50.4	49.6
4	MT272807.1	Sardinella maderensis	27.8	31.1	22.1	19.0	49.9	50.1
5	MN392930.1	Sardinella gibbosa	26.1	31.8	22.3	19.7	48.4	51.5
6	MN586892.1	Sardinella sindensis	26.1	31.8	22.3	19.7	48.4	51.5
7	MT293977.1	Sardinella lemuru	29.2	30.9	21.1	18.8	50.3	49.7
8	KP001108.1	Sardinella longiceps	28.7	31.1	20.9	19.2	49.6	50.3
9	JQ365537.1	Sardinella sp.	29.7	30.4	20.7	19.2	50.4	49.6
10	AM911174.1	Sardinella janeiro	29.7	30.4	20.7	19.2	50.4	49.6
11	MK585640.1	Sardinella hualiensis	27.3	31.8	19.5	21.4	46.8	53.2
12	MH141144.1	Sardina pilchardus	29.0	29.7	20.2	21.1	49.2	50.8
13	OQ459661.1	Sardinella aurita	29.5	31.4	20.2	19.0	49.7	50.4
14	LBK2024	Sardinella spp.*	29.0	31.1	20.9	19.0	49.9	50.1
		Average	28.3	30.8	21.2	19.6	49.5	50.4

The asterisk denotes a fish sample from the fish landing center-tanjung luar-east Lombok of Indonesia.

Download Excel Table

Data for comparison of nucleotide composition of 14 individuals of Sardine based on CO1 gene sequences is presented in Fig. 4 and is homogeneous.

Fig. 4. Nucleotide composition of 14 individuals of Sardine based on c oxidase subunit 1 (CO1) gene sequences.

Download Original Figure

Genetic distance

The genetic distance ranges from 0.00 to 0.33 in CO1 fragments of 14 individual Sardines (Table 3).

Table 3. Estimates of pairwise genetic distances and standard error of 14, family Clupeidae under Kimura 2-parameter model in c oxidase subunit 1 (CO1) genes

No	1	2	3	4	5	6	7	8	9	10	11	12	13
1
2	0.26
3	0.26	0.01
4	0.22	0.19	0.20
5	0.34	0.28	0.27	0.28
6	0.26	0.27	0.25	0.23	0.37
7	0.23	0.19	0.18	0.13	0.28	0.22
8	0.34	0.24	0.23	0.24	0.29	0.27	0.20
9	0.22	0.19	0.20	0.00	0.28	0.23	0.13	0.24
10	0.31	0.03	0.28	0.29	0.32	0.29	0.27	0.30	0.29
11	0.26	0.02	0.01	0.20	0.30	0.26	0.16	0.24	0.20	0.27
12	0.26	0.02	0.01	0.20	0.30	0.26	0.16	0.24	0.20	0.27	0.00
13	0.28	0.26	0.24	0.21	0.32	0.06	0.21	0.25	0.21	0.32	0.25	0.25
14	0.33	0.27	0.26	0.29	0.28	0.29	0.31	0.31	0.29	0.29	0.28	0.28	0.30

Download Excel Table

Phylogenetic reconstructions

The maximum likelihood (ML) tree revealed distinct clades separated based on the genus of Sardine, Family Clupeidae (Fig. 5).

Fig. 5. Maximum-likelihood (ML) phylogenetic of 14 Sardines c oxidase subunit 1 (CO1) genes sequence using the Kimura-2 model, bootstrap value 10,000 replications. *The asterisk denotes fish sample from the fish landing center-tanjung luar-east Lombok of Indonesia.

Download Original Figure

Discussion

Sardines from the fish landing center-tanjung luar-east Lombok

The morphological characteristics of Sardines are generally found at fish landing center-tanjung luar-east Lombok, similar to Sardines in Indonesia waters, and often consist of S. lemuru, S. atricauda, S. longiceps, S. fimbriata, S. clupeoides, and S. leiogaster. Aside from the Sardine names, other names of Sardines from different genera are Amblygaster sirm, Herklotsichthys quadrimaculatus, Herklotsichthys dispilonotus, and Escualosa thoracata (Hunnam, 2021).

Extraction and amplification of c oxidase subunit 1 (COI ) gene fragments

The qualitative Mitochondrial DNA isolation test in Fig. 3 showed a thin band on the agarose gel 1%. This success is similar to the isolation results in the Mitochondrial DNA of marine fish tissues, as reported in previous studies by Mascolo et al. (2019). Amplicons of 688 bp can be polymorphisms causes the possibility of intraspecies occurrence can be polymorphisms because of the possibility of intraspecies occurrence using universal primers (Yang et al., 2014). The CO1 genes have the advantage that their universal primers are very robust, so they can recognize the 5’ end of most animal groups, although not all, but represent most of the phylum Animalia (Ward et al., 2005).

The results of sequence homology comparisons with the NCBI and BOLD databases show that Sardinella spp., research samples from Lombok, have sequences that are similar to the COI gene fragment of the Sardinella aurita. The level of similarity between the two species ranges from 95.00% to 99.00% (NCBI) and 95.0–100.00% in the BOLD system. Accuracy and reliability assessments of sequences generated from the BOLD structure outperformed GenBank in terms of performance for Sardinella spp. (100% and 99%), respectively. From this data, sequence homology results with the NCBI database and the BOLD system show that the research samples from Lombok have sequences that are similar to the COI gene fragment of the S. delicatulus species with very high similarity (99%). The DNA sequence of 688 base pairs encodes 229 amino acids, with the amplified regions located at nucleotide positions.

Nucleotide composition

The sizeable A + T nucleotide composition in Table 2 is in Sardinella hualiensis (46.8%) for the CO1 DNA fragment. In contrast, the great G + C nucleotide composition (53.2%). The mean nucleotide composition among the sequences of 14 Sardinella spp. individuals were estimated as T = 28.3 ± 1.1%, C = 30.8 ± 0.8%, A = 21.2 ± 0.9%, and G = 19.6 ± 0.8%. The content means AT is 49.5 ± 1.0%, and GC is 50.4 ± 1.9%. Nucleotide composition can support fish identification because differences in nucleotide composition indicate variations in nucleotide sites. These varied sites become specific characteristics that can differentiate fish species. According to Kombong & Arisuryanti (2018), differences in the nucleotide composition of the mitochondrial COI indicate genetic variation.

Genomes of fish and amphibians are GC homogenous, whereas birds and mammals are AT/GC heterogeneous, and the exact reason for this phenomenon remains controversial. Data GC/AT on the CO1 gene fragment is homogeneous, as in Fig. 4. The results are supported and compatible with previous research on teleost fish genomes (Symonová & Suh, 2019).

The average percentage of primary composition of CO1 gene fragments in various Sardine species does not match research on Indian freshwater fish with an amplicon of 700 bp and the average nucleotide composition is A: 23.2%; T:29.3%, G:18.9%; C: 28.6%, while in this study A: 21.2, T: 28.3, G: 19.6, C: 30.8% (Lakra et al., 2016). Likewise, the results on Mugil cephalus with rich and poor control areas at G content = 17.5% (Jamandre et al., 2014).

Genetic distance

The calculation of genetic distances using the Kimura-2-parameter method integrated into MEGA 11 is the matrix of distances between base pairs between sequences that approach evolutionary distances. The highest intergeneric divergence (0.33), as presented in Table 3, was between Sardinella spp. and S. fimbriata. In contrast, the highest intrageneric divergence (0.30) is available on Sardinella spp. and S. aurita. S. fimbriata generally showed high sequence divergence from other Sardine species (0.22–0.33) studied here. The lowest intergeneric sequence divergence was between S. hualiensis, Sardina pilchardus, and A. sirm (0.00). In contrast, the great size intrageneric distance is Sardinella spp. with S. fimbriata, S. lemuru, and S. aurita (0.31).

Genetic distance is a measure of the transformation divergence between copies of homologous genes which share a common ancestor (Doğan & Doğan, 2016). Nei’s genetic distance value ranges from 0.0005 to 0.1006. The genetic distance value of 0.010–0.099 is in the low category. The genetic distance analysis found that the lowest genetic distance (0.00) occurred in S. hualiensis and Sardina pilchardus. Maximum and minimum genetic distance was recorded as 0.33 to 0.00 in the Sardine family Clupeidae, called the high distance category. This study is similar to research reported that average heterozygosity ranged from 0.3009 to 0.3744 (Mahboob et al., 2019). This information on genetic polymorphism is helpful for the concerned authorities evolving strategies to conserve the diversity of tilapia in the country.

Phylogenetic reconstructions

The analysis of the ML method using the Kimura-2 model showed the kinship between species based on the length of the branch lines. Different line lengths indicate the level of evolution of each species. Longer lines indicate further evolutionary distances, while shorter lines indicate closer evolutionary distances of a species. All Sardine species in this phylogenetic tree form clades and one individual (A. sirm) is outside the group (Fig. 5), while in the NJ, A. sirm is inside the group. In contrast, the unweighted pair group method with arithmetic places A. sirm and Sardina pilchardus outside the group, while the ML method places A. sirm alone outside the Sardine group, the estimation methods of ML is the best model because it has the minimum evolutionary change compared to the UPGAMA, dan NJ methods.

The results of nucleotide alignment along the 688 bp mitochondrial DNA COI gene fragment support this identification result, as presented in Fig. 5. Most of these sequences matched to reference sequences on GenBank or BOLD databases with more than 95% identity. The overall mean distance of the Sardine sequences was 23%. Analysis based on the phylogenetic tree shows that all Sardine members of the family Clupeidae are monophyletic and grouped into three Clads, namely Clad 1 consists of S. fimbriata, Sardinella albella, Sardinella hualensis, Sardinella maderensis, S. gibbose, and Sardinella sindensis; Clad 2: S. lemuru, S. longiceps, Sardinella janeiro, and Sardinella sp.; Clad 3: Sardina pilchardus, S. aurita, and Sardinella spp*., while A. sirm is out of the group. The reconstruction of the phylogenetic tree in Fig. 5 confirms that Sardines from the fish landing center of Tanjung Luar-East Lombok and S. aurita are in the same clade as S. aurita and have a very close relationship. The same results in preliminary research in Lombok Strait, but using a different gene fragment, the 12S gene, found that Sardine (S. lemuru) and S. aurita have a very close relationship (Mahrus et al., 2012).

A phonetic ML tree using K2P distances illustrated CO1-based genetic divergence among intra and inter-specific hierarchical units. There is grouping on All species. Thus, the COI gene correctly identified all species. Bootstrap analysis consensus trees with 10,000 replicates obtained from ML and MP showed similar patterns in the genetic relationship among the Sardine species studied. There was no haplotype sharing or overlapping. An average haplotype diversity (Hd) using DNAsp average of 0.9743. They mean the genetic diversity of Sardine is high. However, the population structure is low (F_st = 0.37488). The phylogenetic trees using the haplotypes for COI genes could unambiguously differentiate all the 14 species of wild in marine waters. Population structure plays the role of analyzing gene expression in the population (Fachrul et al., 2023). The analysis of molecular data is needed to recognize the extensive phylogenetic relation, investigate population composition inside a species, and resolve taxonomic ambiguities, especially in Sardine species, and has proved to be principal (Devanand et al., 2020). It is suggested that CO1 genes, as DNA barcoding, was a build-on tool to taxonomists, besides conventional taxonomy, could help to find a solution to the taxonomic ambiguities of Sardine taxonomy especially.

The results of this study, which has a low population structure with high genetic diversity, are similar to the results of research on an economically important fish species (Lutjanus jocu) reported by Verba et al. (2023). Preliminary research also reported similar cases of low population structure and genetic diversity in Rhinolophus blasii in Europe (Jakab et al., 2021). Factors that influence low population structure consist of the availability of low amounts of food, predators, competition with living creatures of the same species or other species, climate, changes in habitat, and disease (Verba et al., 2023). Next, differences in depth and color in these fish are not significant enough to be found as an indication of new species. In addition, the distribution of larvae, juvenile movement, and migration of adult fish that are actively moving can cause a species or population in different areas to have genetic connections or connectivity.

Using molecular DNA barcoding can identify organisms based on short and explicit DNA sequences, one of which is the CO1 gene (Ward et al., 2005). Mitochondrial DNA of the CO1 gene is one of the markers to determine the population. Mitochondrial DNA markers can determine molecular diversity in animals because they are easy to explain and produce population connectivity in a short time. Population connectivity also affects the adaptation of a species. The information related to population connectivity can be a guide in carrying out conservation and recovery efforts of a population because it has a direct or indirect impact on a population’s adaptation to its environment. Verba et al. (2023) reported that low population structure is affected by factors as well as restricted nourishment availableness, hunters, rivalry with other species or taxonomic groups, climate, natural environment changes, and illness. Unfortunately, the factors have not been investigated yet in the present, so they will be a particular concern in the future. Finally, it hopefully supports management activities, determination of conservation areas, and sustainable strategies to maintain and preserve Sardine fish resources in Indonesia.

Conclusion

COI gene fragments successfully amplified Sardines with an amplicon length of around 688 bp from fish landing center-Tanjung Luar, East Lombok. The study indicates that the mitochondrial DNA COI gene sequence can identify wild Sardine species in high levels of genetic divergence (0.00–0.34). Data on Hd using DNAsp: 0.97436 means the genetic diversity of Sardine is high. However, the population structure is low (F_st = 0.37488). Besides that, the COI gene sequence can be acceptable as an authentic identification marker for various marine fish species.

Competing interests

No potential conflict of interest relevant to this article was reported.

Funding sources

The Research was backed by the Ministry of Education, Research, Culture, and Technology of the Indonesia Republic via the incentive funding of the Fundamental Research Program in 2024 with grant number 060/E5/PG.02.00.PL/2024.

Acknowledgements

The researchers would like to thank the Ministry of Education, Research, Culture, and Technology of the Indonesia Republic for supporting the funding of this research through the incentive funding of the Fundamental Research program in 2024. The Researchers are also grateful to the Immunology Laboratory assistant, Sitti Rosida, who has helped a lot in laboratory activities such as isolation, extraction, and PCR to get DNA CO1 genes.

Availability of data and materials

Upon reasonable request, the datasets of this study can be available from the corresponding author.

Ethics approval and consent to participate

This study conformed to the guidance of animal ethical treatment for the care and use of experimental animals.

References

Barrientos-Villalobos J, Schmitter-Soto JJ. Phylogeography of the Mayan cichlid Mayaheros urophthalmus (Teleostei: Cichlidae) in the Yucatan peninsula based on mitochondrial markers CYTB and COI. Environ Biol Fish. 2019; 102:1461-72

Bingpeng X, Heshan L, Zhilan Z, Chunguang W, Yanguo W, Jianjun W. DNA barcoding for identification of fish species in the Taiwan Strait. PLOS ONE. 2018; 13e0198109

Carneiro M, Martins R, Reiner F, Batista I. Ichthyofauna of Portugal: taxonomic diversity, common and scientific names of marine fishes. Lisbon: Portuguese Institute for Sea and Atmosphere (IPMA, I.P.). 2019.

Choi EH, Kim G, Cha SH, Lee JS, Ryu SH, Suk HY, et al. Molecular phylogenetic, population genetic and demographic studies of Nodularia douglasiae and Nodularia breviconcha based on CO1 and 16S rRNA. Sci Rep. 2020; 10:16572

Devanand TN, Anjanayappa HN, Kumar G, Naveen Kumar BT, Gopan A, Panda K. A comparative study of COI and 16S rRNA genes for DNA barcoding of Indian marine clupeids. J Exp Zool India. 2020; 23:1439-44.

Doğan İ, Doğan N. Genetic distance measures: review. Turkiye Klinikleri J Biostat. 2016; 8:87-93

Eisele MH, Madrigal-Mora S, Espinoza M. Drivers of reef fish assemblages in an upwelling region from the Eastern ropical Pacific ocean. J Fish Biol. 2021; 98:1074-90

Fachrul M, Karkey A, Shakya M, Judd LM, Harshegyi T, Sim KS, et al. Direct inference and control of genetic population structure from RNA sequencing data. Commun Biol. 2023; 6:804

Hellberg RS, Kawalek MD, Van KT, Shen Y, Williams-Hill DM. Comparison of DNA extraction and PCR setup methods for use in high-throughput DNA barcoding of fish species. Food Anal Methods. 2014; 7:1950-9

10.

Hendrawan IG, Asai K. Numerical study of tidal upwelling over the sill in the Lombok Strait (Indonesia).In In: Proceedings of the Twenty-first International Offshore and Polar Engineering Conference 2011, Maui, HI

11.

Hendrikx S. 3 Identification of Herring species (Clupeidae) in Conrad Gessner’s ichthyological works: a case study on taxonomy, nomenclature, and animal depiction in the sixteenth century.In In: Enenkel KAE, Smith PJ, editors.editors Zoology in early modern culture: intersections of science, theology, philology, and political and religious education. Leiden: Brill. 2014; p p. 149-71

12.

Hunnam K. The biology and ecology of tropical marine Sardines and herrings in Indo-West Pacific fisheries: a review. Rev Fish Biol Fish. 2021; 31:449-84

13.

Jakab E, Bücs S, Jére C, Csösz I, Jakab RI, Szodoray-Parádi F, et al. Low population structure and genetic diversity in Rhinolophus blasii at the northern limit of its European range: are there undiscovered colonies?. Acta Chiropt. 2021; 23:301-11

14.

Jamandre BW, Durand JD, Tzeng WN. High sequence variations in mitochondrial DNA control region among worldwide populations of flathead mullet Mugil cephalus. Int J Zool. 2014; 2014:564105

15.

Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019; 20:1160-6

16.

Kombong CBS, Arisuryanti T. The 16S and COI mitochondrial DNA nucleotide composition of stripped snakehead (Channa striata Bloch, 1793) from Lake Sentani, Jayapura, Papua. J Perikanan Univ Gadjah Mada. 2018; 20:57-62

17.

Lakra WS, Singh M, Goswami M, Gopalakrishnan A, Lal KK, Mohindra V, et al. DNA barcoding Indian freshwater fishes. Mitochondrial DNA A. 2016; 27:4510-7

18.

Mahboob S, Al-Ghanim KA, Al-Misned F, Al-Balawi HFA, Ashraf A, Al-Mulhim NMA. Genetic diversity in tilapia populations in a freshwater reservoir assayed by randomly amplified polymorphic DNA markers. Saudi J Biol Sci. 2019; 26:363-7

19.

Mahrus H, Idrus AA, Zulkifli L. Molecular phylogeny of anchovy (Clupeiformes: Clupeidae) from southern waters of Lombok using mitochondrial DNA CO1 gene sequences. Biodivers J Biol Divers. 2022; 23:2433-43

20.

Mahrus, Sumitro SB, Utomo DH, Sartimbul A, Toha AH, Widodo N. Genetic relationship of Sardinella lemuru from Lombok Strait with fish rich in omega-3 fatty acid. Bioinformation. 2012; 8:1271-6

21.

Mascolo C, Ceruso M, Sordino P, Palma G, Anastasio A, Pepe T. Comparison of mitochondrial DNA enrichment and sequencing methods from fish tissue. Food Chem. 2019; 294:333-8

22.

Miya S, Takahashi H, Nakagawa M, Kuda T, Igimi S, Kimura B. Genetic characteristics of Japanese clinical Listeria monocytogenes isolates. PLOS ONE. 2015; 10e0122902

23.

Mohanty M, Jayasankar P, Sahoo L, Das P. A comparative study of COI and 16 S rRNA genes for DNA barcoding of cultivable carps in India. Mitochondrial DNA. 2015; 26:79-87

24.

Nuryanto A, Dewi RE, Pramono H. Genetic homogeneity of Commerson’s Anchovy (Stolephorus commersonnii) in Segara Anakan Cilacap Central Java inferred from PCR-RFLP Markers. Biogenesis. 2019; 7:14-23

25.

Peristiwady T. Ichthyological research and biodiversity of marine fishes in Indonesia. IOP Conf Ser Earth Environ Sci. 2021; 789:012009

26.

Phillips JD, Gillis DJ, Hanner RH. Incomplete estimates of genetic diversity within species: implications for DNA barcoding. Ecol Evol. 2019; 9:2996-3010

27.

Rahayu DA, Jannah M. DNA barcode hewan dan tumbuhan Indonesia. Jakarta: Yayasan Inspirasi Ide Berdaya (INSIDE). 2019.

28.

Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Evol. 2017; 34:3299-302

29.

Sartimbul A, Rohadi E, Ikhsani SN, Listiyaningsih D. Morphometric and meristic variations among five populations of Sardinella lemuru Bleeker, 1853 from waters of Bali Strait, northern and southern-east Java and their relation to the environment. AACL Bioflux. 2018; 11:744-52.

30.

Sidiq M, Ahmed I, Bakhtiyar Y. Length-weight relationship, morphometric characters, and meristic counts of the coldwater fish (Heckel) from Dal Lake. Fish Aquat Life. 2021; 29:29-34

31.

Suherman AS, Boesono H, Kurohman F, Muzakir AK. Kinerja pelabuhan perikanan nusantara (PPN) pengambengan Jembrana Bali (performance of pengambengan nusantara fishing port (Nfp) Jembrana-Bali). Saintek Perikanan IJoST. 2020; 16:123-31

32.

Symonová R, Suh A. Nucleotide composition of transposable elements likely contributes to AT/GC compositional homogeneity of teleost fish genomes. Mob DNA. 2019; 10:49

33.

Tamura K, Stecher G, Kumar S. MEGA11: molecular evolutionary genetics analysis version 11. Mol Biol Evol. 2021; 38:3022-7

34.

Thomas RC, Willette DA, Carpenter KE, Santos MD. Hidden diversity in Sardines: genetic and morphological evidence for cryptic species in the goldstripe Sardinella, Sardinella gibbosa (Bleeker, 1849). PLOS ONE. 2014; 9e84719

35.

Tinungki GM, Sirajang N. The dynamic system analysis of lemuru fishery in Bali Strait by using biological parameter yield of some surplus production models. IOP Conf Ser Earth Environ Sci. 2019; 279:012054

36.

Vishnupriya KM, Nair RJ, Sangeetha AT. History of ichthyology.In ICAR-CMFRI: winter school on recent development in taxonomic techniques of marine fishes for conservation and sustainable fisheries management. In: Sagar MV, Nair RJ, Sa T, Sagar MV, editors.editors Kochi: ICAR-Central Marine Fisheries Research Institute. 2022; p p. 12-22.

37.

Wang ZL, Yang XQ, Wang TZ, Yu X. Assessing the effectiveness of mitochondrial COI and 16S rRNA genes for DNA barcoding of farmland spiders in China. Mitochondrial DNA A. 2018; 29:695-702

38.

Ward RD, Zemlak TS, Innes BH, Last PR, Hebert PDN. DNA barcoding Australia’s fish species. Philos Trans R Soc Lond B Biol Sci. 2005; 360:1847-57

39.

Verba JT, Stow A, Bein B, Pennino MG, Lopes PFM, Ferreira BP, et al. Low population genetic structure is consistent with high habitat connectivity in a commercially important fish species (Lutjanus jocu). Mar Biol. 2023; 170:1-15

40.

Yang L, Tan Z, Wang D, Xue L, Guan M, Huang T, et al. Species identification through mitochondrial rRNA genetic analysis. Sci Rep. 2014; 4:4089