Persistent infection with hepatitis C virus (HCV) causes cirrhosis and hepatocellular carcinoma. It is estimated that 1 million people in Japan[1] and 58 million people worldwide have chronic HCV infection, and 0.3 million people die annually from hepatitis C globally, mostly from cirrhosis and hepatocellular carcinoma (
Mutations in the viral gene are involved in the cause of DAA-resistant virus,[6] disease progression,[7] and hepatocarcinogenesis[8] as well as host factors, and analysis of the viral gene is essential for its elucidation.[9] Until now, the HCV gene has been analyzed by traditional short-read Sanger or deep sequencing.[10] The presence of signature mutations was clinically found regarding resistance mutations to DAA,[11] and these mutations were shown to cause a decrease in drug sensitivity the cell-culture system and the replicon system.[12,13] However, these signature mutations have so far been analyzed separately only in part of the HCV genome, and it is not clear how the haplotype, which is a combination of the entire structure or various mutations, affects the disease state and therapeutic effect.[14] This is because these sequencing technologies have been able to sequence only short molecules up to several hundred base pairs (bp) and cannot discriminate whether detected distant mutations are on a single viral genome molecule or on different ones.[15]
Recently, a long-read single-molecule sequence, which is a third-generation sequencing technology, has been developed and can be applied to the analysis of viral genes.[16] These techniques can analyze the nucleotide sequence of a single molecule in several tens of kilobases (kb) and can analyze the large-scale structure of a gene. In this study, we revealed the change in combinations of HCV mutations (haplotype) on single HCV molecules in DAA treatment failure, using the Oxford Nanopore sequencer, which is a third-generation sequencing technology.[17,18] Using third-generation sequencing, long read is possible, but the read accuracy is low at about 90%. To overcome this problem, we used rolling circle amplification (RCA)-Nanopore sequencing in which the circularized polymerase chain reaction (PCR) amplification product of the HCV genome was amplified by RCA to produce long tandem repeats (concatemer) of the original sequence. These long tandem repeats of single molecules were sequenced by a Nanopore sequencer to correct sequencing errors and to obtain consensus, enabling single-molecule long-read sequencing.[19]
PATIENTS AND METHODS PatientsTwelve patients with chronic HCV genotype 1b infection and who were unsuccessfully treated with asunaprevir (ASV)/daclatasvir (DCV) treatment (60 mg DCV once/day and 200 mg ASV twice/day for 24 weeks of the standard protocol) for chronic hepatitis C in 2015 at the University of Yamanashi Hospital were enrolled in this study. Serum samples were collected from all patients during treatment and store at −20°C before analysis. HCV genome sequences in the serum of each patient were analyzed at the start of ASV/DCV treatment, at treatment failure, and at the end of follow-up. The clinical course of each patient, including the time points of sample collection and viral titers, are shown in Table 1. The study was reviewed and approved by the Institutional Review Board and performed with the necessary patient consent. This study protocol complied with all provisions of the Declaration of Helsinki.
TABLE 1 Course of ASV/DCV therapy and sampling time points for sequencing analyses
Abbreviations: AE, adverse event; NVR, null viral response; Tx, treatment.
aSampling time points were different between direct and Nanopore sequencing in some cases.
bNVR (HCV RNA not becoming undetectable during the treatment course).
RNA extraction and reverse transcriptionTotal RNA was extracted in 30 μL of RNA solution from 200 µL of serum by using the QIAmp Mini Elute Virus Spin Kit (Qiagen, Germany). To synthesize complementary DNA (cDNA) as described,[20] 7 μL of RNA solution was reacted at 49°C for 65 min (followed by 85°C for 5 min) in 20 μL of reverse-transcription (RT) solution of SuperScript III reverse transcriptase (Thermo Fisher Scientific, Waltham, MA), RV1 primer[21] or a pangenomic primer Vir7 (oligo-2′-deoxyadenosine [oligo dA20]),[20] deoxyribonucleotide triphosphates (dNTPs), and betain (Sigma Aldrich). Two units of ribonuclease H were then added before a final incubation at 37°C for 20 min.
PCR reactionApproximately 3 kb of the sequence spanning the NS3 and NS5A regions of the HCV genome were PCR amplified using PrimeSTAR GXL (Takara, Japan). PCR primers designed based on a published report[21] are listed in Table 2. The first PCR was performed with cDNA in 2.5 μL of RT solution (equivalent to 5.83 μL of serum) using the first-round sense and antisense primers, dNTPs, and betaine. The cycling condition was preincubated at 98°C for 2 min, 40 cycles at 98°C for 10 seconds, 60°C for 15 seconds, and 68°C for 3 min, with a final extension for 3 min. The nested PCR was performed with the first PCR product as described for the first PCR, using the second-round sense and antisense primers. If needed, the second-round sense primers with barcode sequences of 24 nucleotides (following the Nanopore PCR Barcoding Kit; Oxford Nanopore Technologies, Oxford, United Kingdom) attached at the 5′ end were used so that the PCR products from each sample were identifiable after pooled Nanopore sequencing. After extraction of the target PCR products from agarose gel electrophoresis using the Monarch DNA Gel Extraction Kit (New England Biolabs, Ipswich, MA), PCR products were quantified using a pico green double-stranded DNA assay kit (Invitrogen, Japan), the concentration of each sample was adjusted for standardization, and pooled samples were prepared for further analyses.
TABLE 2 Primers for HCV genome PCR amplification
Primer Name | Sequence (5′-3′)a | Binding Positionb | Case No. | |||||
All (Except Right) | 1 | 2 | 4 | 5 | 8 | |||
Sense Primer | ||||||||
FW3[21] | ACAGGTCGGGACAAGAACCAG | 3483–3503 | S-1, F-1, E-1 | F-1 | E-1, R-1 | S-1, E-1 | S-1, F-1, E-1 | S-1, E-1 |
FW4[21] | ACAAGAACCAGGTCGATGGGGAG | 3493–3515 | S-2, F-2, E-2 | E-2 | E-2 | S-2, E-2 | S-2 | |
FW3new | CACAGGCCGGGACAAGAACCA | 3482–3502 | F-1 | F-1 | S-1 | F-1 | E-1 | |
FW4new | GTCGAGGGGGAGGTTCAAGTGGT | 3504–3526 | F-2 | F-2 | S-2 | F-2 | E-2 | |
FW5 | GTYCARGTGGTTTCYACCGCWACRCA | 3516–3541 | S-1, E-1 | S-1 | F-1 | |||
FW6 | TCYTTCYTGGCGACCTGYRTCAAYGG | 3543–3568 | S-2, E-2 | S-2 | F-2 | |||
FWout | CTTGGTTGCATCATCACTAGCCTCACAGG | 3459–3487 | F-1 | |||||
FWin | ATCCCCCACCCGCGTAACCTCCACGTACTC | 3480–3508 | F-2 | |||||
Antisense primer | ||||||||
RV1[21] | CCCGTCACGTAGTGGAAATC | 6633–6652 | S-1, F-1, E-1 | F-1 | E-1, R-1 | S-1, E-1 | S-1, F-1, E-1 | S-1, E-1 |
RV2[21] | GCGTAACCTCCACGTACTCC | 6605–6624 | S-2, F-2, E-2 | E-2 | E-2 | S-2, E-2 | S-2 | |
RV2new | CGCGTAACCTCCACGTACTC | 6606–6625 | F-2 | F-2 | S-2 | F-2 | E-2 | |
RV5 | CTCAGCRGCYACCCGCCACAGCGCC | 6581–6605 | S-1, E-1 | S-1 | F-1 | |||
RV6 | TGGARTARTTTGGMGCYGGGGAGGG | 6555–6579 | S-2, E-2 | S-2 | F-2 | |||
RVout | GTCAGTGGTCATGCCCGTCACGTAGTGGAA | 6636–6665 | F-1 | |||||
RVin | ATCCCCCACCCGCGTAACCTCCACGTACTC | 6606–6635 | F-2 |
Abbreviations: 1, first-round PCR; 2, second-round PCR; E, end of follow-up; F, treatment failure; S, start of treatment.
aFor degenerate primers, B = C or G or T; H = A or C or T; M = A or C; N = A or C or G or T; R = A or G; S = C or G; W = A or T; V = A or C or G; D = A or G or T; Y = C or T.
bGenome binding position with reference to HCV GT1b strain, HC-J4L6S; GenBank accession AF054247.1.
LigationCircular ligation of pooled PCR products was performed with the Quick Ligation Kit (New England Biolabs). The mixture was incubated at room temperature for 5 min. Ligated products were purified and concentrated using the Monarch PCR and DNA Cleanup Kit (New England Biolabs). Plasmid-safe adenosine triphosphate-dependent deoxyribonuclease (DNase) (Epicentre Biotechnologies, Madison, WI) digest was performed at 37°C for 1 hour to remove noncircularized products. The DNase was then heat inactivated at 70°C for 30 min.
RCA reactionThe circularized DNA template molecules were purified and concentrated by the Monarch PCR and DNA Cleanup Kit (New England Biolabs). The RCA reaction was performed using the TruePrime RCA Kit (Expedeon). Conditions for RCA were 30°C for 24 hours before heat inactivation at 65°C for 10 min. RCA products were purified with Agencourt AMPure XP beads (Beckman Coulter) and eluted in distilled water. Then NEB buffer 2 and T7 endonuclease (New England Biolabs) were added and incubated at 37°C for 90 min to resolve potential branching generated by the RCA. Debranched products were purified using AMPure XP beads and eluted. To eliminate low-molecular DNA, the Short-Read Eliminator Kit (Circulomics, Baltimore, MD) was used according to the manufacturer’s protocol.
Nanopore sequencing reactionLibraries were prepared with 1D amplicon/cDNA by ligation protocol (SQK-LSK109; Oxford Nanopore Technologies) and sequenced using R9.4.1 or R10.3 flowcells on a MinION Mk1B sequencer using MinKnow software on a personal computer or MinIT system (Oxford Nanopore Technologies) for 24 to 48 hours. Barcode sequences were appended on the templates using the Nanopore Barcode Kit (Oxford Nanopore Technologies), or the identical sequences were attached to the 5′ of the second-round sense primers for later multiplex analysis.
Bioinformatic analysesThe raw Nanopore reads of the RCA concatemers were base called using MinKnow and Guppy software (Oxford Nanopore Technologies), generating fastq files from fast5 files. We filtered the fastq reads of each sequencing run (those with a quality score <7–10 according to total read counts to avoid processing an excessive amount of sequence data, and length <30 kb) by using seqkit (
In addition to sequencing HCV from clinical samples, we also applied our sequencing methods to plasmids to investigate the specificity and sensitivity for mutation detection of our approach using a template from plasmids containing different HCV sequences. We used the HCV plasmid full-genome construct of pCV-J4L6S[23] (kindly provided by Dr. Yanagi, National Institutes of Health, Bethesda, MD), and pORN/C-5B[24] (kindly provided by Dr. Ikeda, Okayama University, Okayama, Japan) as the control template for PCR, RCA, and Nanopore sequencing, along with the representative clinical samples obtained from patients with HCV infection and liver cirrhosis (LC). To evaluate the threshold values at which fewer sequences can be detected, PCR products from the plasmids having pORN/C-5B were mixed at 2%, 9%, 17%, and 50% ratios with those from pCV-J4L6S and were Nanopore sequenced.
Sanger sequencing and deep sequencing of signature RASTotal RNA was extracted from 150 µL of serum, using the QIAquick Viral Mini Kit (Qiagen), and cDNA was synthesized with random hexamer and SuperScript II (Thermo Fisher) and amplified by PCR separately for the NS3 and NS5A regions containing target RASs.[22,25] Direct (populational) sequencing of the PCR product for targeted signature RASs was carried out by Sanger sequencing and deep sequencing with the next-generation sequencer GS Junior System (Roche, Branford, CT) using the described protocol.[22,25]
RESULTS Outline of Nanopore sequencingThe graphical outline of the long-read single-molecule sequencing using the Nanopore sequencer is shown in Figure 1. We successfully obtained long reads more than 30 kb consisting of the concatemer of the single-molecule HCV sequence and determined their consensus sequences to correct Nanopore sequencing errors by aligning the tandem-repeated sequences amplified by RCA. The approximate 3.2-kb product amplified by RT-PCR was circularized and amplified by RCA, and the synthesis of high-molecular weight DNA was confirmed by agarose gel electrophoresis. When the RCA product was treated with a restriction enzyme (Sac I), which generates 2.8 kb and 0.4 kb fragmentes, or with Ale I, which generates 1.9-kb and 1.3-kb fragments from each repeat sequence, DNA fragments with expected sizes were detected on agarose gel electrophoresis, confirming that the circularized molecules were amplified by RCA as a concatemer (Figure 1).
By processing the PCR-RCA product derived from the HCV plasmid with the short-read eliminator before Nanopore sequencing, it was possible to efficiently remove reads of 10 kb or less. The proportion of reads over 30 kb was approximately 0.016% but increased to 1.5% after the short-read eliminator treatment (filtered reads/total reads, 126/780,000 versus 1945/130,804; filtering condition, quality score <8, length <30 kb). A snapshot of an example of a 70-kb long read that was searched, using the Basic Local Alignment Search Tool (BLAST), with the HCV sequence (
Nanopore sequencing using plasmid containing pCV-J4L6S as templates showed under a 0.1% substitution rate, indicating Nanopore sequencing with RCA achieved high accuracy (lower right graphs in Figure 1). In contrast, about a 1% substitution rate was observed in clinical samples from patients with LC, suggesting the reflection of quasispecies. It was possible to detect as low as 2% minor sequences in analysis of 100 reads in a semiquantitative manner when PCR products from different plasmids (pCV-J4L6S and pORN/C-5B) were mixed in various proportions, indicating high specificity of the present method for detecting quasispecies.
Nanopore sequencing results from clinical samplesDetails of the results of nine independent Nanopore sequencing runs (run numbers 1–9), including a total of 36 samples consisting of three time points (start of treatment, treatment failure, and end of treatment) from 12 patients (case numbers 1–12) are shown in Table 3. Each sequencing run included different combinations of patients and time points because of an arbitrary order of sample preparation. Read counts, length, and quality scores are shown before and after filtering with length >30 kb and quality >7–10 and after demultiplexing. Total counts of reads were 22,164–384,000 (depending on the applied amounts of samples and run time) and were sufficient for downstream analyses. After filtering the short or low-quality reads, 735–3802 reads were usable for demultiplexing, and the read sets from a total of 36 different samples were obtained with read counts of 121–310, length of 30.0–107.8 kb, and quality score 7.0–12.2. The sum of read counts after demultiplexing does not necessarily match with read counts after filtering because each sequencing run included samples for other experiments. From each read set after filtering and demultiplexing, the longest 100 reads were selected to determine the consensus of the multiple repeats in the concatemer generated by RCA from the single molecules.
TABLE 3 Details of Nanopore sequencing results from clinical samples
Abbreviations: end, end of follow–up; failure, treatment failure; start, start of treatment.
Signature RAS detected by populational and Nanopore sequencingA summary of the signature RAS detected by Nanopore and direct (populational) sequencing at NS3-D168, NS5A-P29, NS5A-P32, NS5A-L31, and NS5A-Y93 in 12 patients is shown in Table 4.[12] All patients had RASs in these positions during the course with changes over time. Nanopore sequencing detected most of the signature mutations in a quantitative manner. Although sampling time points of Nanopore sequencing were not necessarily the same for all cases with direct sequencing (Table 1), information of populational sequencing results from the same treatment phases (start of treatment, treatment failure, and end of follow-up) were useful for evaluating the validity of Nanopore sequencing. Analyses of 100 reads at each time point could find the minor RAS that was not detected by direct sequencing, finding most patients had a minor viral population with various types of substitutions at these signature RAS positions. The results of target deep sequencing of case 2 and case 10 showed a similar proportion of major substitution with Nanopore sequencing (Table 5). In case 5, NS5A-P29 deletion mutant, which is highly resistant to pibrentasvir, (
TABLE 4 Signature RAS found in NS3 and NS5A during ASV/DCV therapy
Abbreviations: A, Alanine; C, Cysteine; D, Aspartate; E, Glutamate; F, Phenylalanine; G, Glycine; H, Histidine; I, Isoleucine; K, Lysine; L, Leucine; M, Methionine; N, Asparagine; P, Proline; Q, Glutamine; R, Arginine; S, Serine; T, Threonine; V, Valine; W, Tryptophan; Y, Tyrosine.
aNumbers in parentheses in the Nanopore column are the percentage of reads with corresponding amino acids by Nanopore sequencing.
bTwo amino acids divided by the slash in the Direct column mean the mixture of amino acids; the first letter indicates the higher peak in the electropherogram.
cTime period for Nanopore sequencing.
TABLE 5 Comparison of Nanopore sequencing and target deep sequencing for signature RAS
Case No. | NS5A-L31 | NS5A-Y93 | ||||||
Start of Treatment | Treatment Failure | Start of Treatment | Treatment Failure | |||||
Nanopore | Target Deep | Nanopore | Target Deep | Nanopore | Target Deep | Nanopore | Target Deep | |
2 | I(98) L(2) | I(99.91) | I(96) V(2) M(1) L(1) | I(99.57) M(0.26) V(0.13) | Y(99) H(1) | Y(99.97) | H(95) Y(5) | H(99.57) R(0.32) Y(0.10) |
10 | L(95) M(5) | L(99.92) | M(91) F(5) I(2) L(1) V(1) | M(95.10) I(2.60) V(1.96) L(0.19) | Y(86) H(14) | Y(84.13) H(15.41) C(0.31) | H(98) Y(2) | H(99.58) R(0.28) Y (0.11) |
Numbers in parentheses are the percentage of reads with corresponding amino acids.
Abbreviations: C, Cysteine; F, Phenylalanine; H, Histidine; I, Isoleucine; L, Leucine; M, Methionine; R, Arginine; V, Valine; Y, Tyrosine.
Single-molecule, long-read sequencing of HCV haplotype during DAA treatmentThe alignment (left panel) and phylogenetic (right panel) analysis of 100 long reads at three time points (start of treatment, treatment failure, end of follow-up) in 12 patients is shown in Figure 2. Amino acid substitutions from the consensus for each patient are depicted in alignment panels. Phylogenetic trees with a scale bar for genetic distance for each patient showed that several haplotypes can be detected in each patient. In the alignments of individual HCV genomes from single patients, numerous substitutions composing quasispecies found over the entire length of the PCR-amplified NS3 to NS5A region revealed the large-scale structures of quasispecies that have not been elucidated by short-read deep sequencing to date. Multiple nontypical RASs could be detected in a patient-specific manner. There were multiple HASs other than signature RASs in each patient. Haplotypes were not always consistent with the three time points, indicating that each haplotype reacted differently to DAA. In each haplotype, there were haplotype-specific RASs that changed over time during treatment in a haplotype-specific manner. Genetic distance of each phylogenetic tree can be estimated with scale bars that differ among cases; for example, cases 1 and 7 showed much lower genetic distances compared to other cases.
To elucidate the relations between HAS and RAS, serial changes of HAS during the DAA treatment were analyzed. The dynamic changes of HAS in each patient during the course with different amino acid substitutions among time points or haplotypes for each case are shown in Figure 3. Colored barcode patterns depicting the different amino acid residues and heights reflect the proportions of different residues at each position. These RASs and HASs extracted statistically by meta-CATS analysis with significant p values (5 x 10–5) demonstrated right-most columns for each HAS or RAS. The composition of haplotypes changed dynamically during the course in each patient, as shown in the blue, red, and green bars. In addition, haplotype type-specific RASs, which are only found in specific haplotypes, were found in 10 of 12 patients. This analysis clearly showed the dynamic changes of haplotype composition that take place during DAA failure, which have been considered to be caused by a simple RAS-driven decrease of DAA susceptibility of the HCV mutant. Instead, DAA failure is related to the selections of HCV haplotypes with changes of haplotype-specific RAS. The normalized Shannon entropies of the 12 patients for each time point were (mean [SD]) 0.470 [0.213], 0.549 [0.260], and 0.701 [0.195] for start of treatment, treatment failure, and end of follow-up, respectively. These were significantly different among time points (p = 0.0338 with one-way repeated-measures analysis of variance; p = 0.0335 for start of treatment versus end of follow-up with Bonferroni correction).
In this study, we analyzed the HCV genomes related to DAA treatment failure by using Nanopore, a third-generation sequencer that enables long-read single-molecule sequencing of a target with more than 10 kb.[17] As a result, we clarified for the first time that DAA-resistant HCV consisted of resistant virus haplotypes, quasispecies with various HASs, and changed over time during the treatment with different RASs. This indicates that resistance to DAA,[6] growth fitness in the host,[26] and the ability to escape from the immune reaction are different depending on the haplotype[27] and not on the simple individual signature mutations of HCV genomes as previously thought. The HCV haplotype may be involved in a pathological process, such as fibrosis or carcinogenesis, caused by persistent infection of HCV. It is expected that the newly developed haplotyping by Nanopore sequencing will contribute to the elucidation of pathogenesis of viral diseases not limited to HCV.[16, 21, 28]
HCV is a single-stranded RNA virus; its RNA polymerase does not have proof-reading activity and forms a quasispecies population with various mutations in the host.[10] Traditional Sanger direct sequencing of PCR products cannot reveal this quasispecies,[29] and sequencing of cloned PCR products can analyze only a small number of quasispecies clones spanning only a partial part of HCV genomes up to several hundred base pairs.[30] In recent years, massive parallel sequencing (100–300 bp), so-called short-read deep sequencing, has become possible by next-generation sequencing, and analysis of HCV quasispecies involved in DAA resistance has been performed for a small part of the HCV genome around signature RASs, showing that individual resistance mutations are associated with treatment failure[22,25,31] and the presence of mutations associated with liver carcinogenesis.[8] However, it has been difficult to analyze the entire structure of HCV with the combination of these individual characteristic mutations (haplotype) from a single viral genome in clinical samples with mixed quasispecies.[32]
The development of a sequencer that enables the long read of a single molecule, such as Nanopore[17] and PacBio,[33] is expected to facilitate analyzing how the haplotype of the above-mentioned viral genome affects the clinical picture. However, these third-generation sequencers have a high sequence error rate of around 10% as a trade-off of long reads.[34] In recent years, a technique has been developed that makes a target molecule circular to amplify and to read a single sequence multiple times as a concatemer to reveal the target sequence with error correction. PacBio with single-molecule real-time (SMRT) technology[35] and Nanopore with RCA technology[36] were developed to amplify and to sequence a concatemer of a target sequence of several tens of kilobases from a circular molecule. Applications of HCV sequencing of PacBio SMRT[37] and hepatitis B virus (HBV) sequencing of Nanopore RCA[38] have been reported.
In this study, Nanopore RCA was applied for the first time to the analysis of the DAA-resistant haplotype of HCV. When a PCR product derived from an HCV plasmid consisting of a single sequence was analyzed by this method, the accuracy was over 99.9% for a long read of an approximate 3000-bp single molecule, and it was shown to have sufficient specificity. On the other hand, when PCR products derived from HCV plasmids with different sequences were spiked, this analysis showed that mutations of at least 2% proportion could be detected. We also correlated the results of Nanopore and target deep sequencing of signature RASs in selected clinical cases and confirmed that proportions of major mutations were similar at each site (Table 5, case 2 and case 10), but those of minor mutations were not exactly the same. This might be caused by PCR selection bias for minor variants because the primer sets used for deep and Nanopore sequencing were different. In addition, if the number of Nanopore reads is further increased, it may be possible to detect variants with a lower frequency in the future.
In the present study, 100 reads were finally analyzed from the PCR-amplified HCV target sequence, and possible selection biases should be considered for evaluation of diversity. The first is PCR duplication bias due to the small number of templates. The number of HCV genomes subjected to PCR was equivalent to those in approximately 5.83 μL of serum, which corresponds to 1/170 of serum HCV titer (IU/mL). The HCV titer was 4.7 to 7.1 (median, 6.15) logIU/mL, which means approximately 50,000 to 12,500,000 (median, 1,400,000) copies of HCV genome per milliliter (Table 1). Therefore, 300 to 70,000 (median, 8000) copies of the HCV genome were subjected to PCR and amplified for downstream analysis. Therefore, analysis of 100 reads can reflect the diversity of the original sample without severe, if any, bias. However, if a larger number of reads are analyzed, it is important to secure enough HCV genome for PCR.
The second is PCR selection bias originating from unequal primer binding or amplification efficiency due to sequence variations among templates. FW3, FW4, RV1, and RV2 primers were effective for 76% (109/144) of PCR reactions, suggesting that binding sites of these primers are well preserved (Table 1). However, other samples still needed different primers, suggesting that PCR selection bias cannot be eliminated whenever PCR is used. Actually, in treatment failure of case 4, mixed RASs detected by direct sequence (D168E, L31V) were not apparent in Nanopore sequencing from the same sample (Table 4), possibly because primers used for direct and Nanopore sequencing were not identical. PCR-free, single-molecule, long-read Nanopore sequencing may be one of the solutions, and it has been successfully applied for HBV, which has a double-stranded circular DNA genome ready for RCA.[38] This technology is still challenging for HCV, which has a single-stranded linear RNA genome requiring additional RT and circularization steps, which result in much lower efficiencies. Further technological breakthrough is needed for complete removal of these PCR biases. We selected the longest 100 reads for phylogenetic analysis after RCA with the TruePrime Kit, which enables uniform amplification of mixed templates because it does not use random primers.[39] Therefore, the final 100 reads were randomly selected from PCR-amplified molecules.
Currently, the standard treatment for HCV is the combination of three classes of DAA, NS3 protease inhibitor, NS5A inhibitor, and NS5B polymerase inhibitor.[3] The NS3 protease inhibitor was the first class of DAAs developed, and its use started in 2010 in combination with interferon. NS5A inhibitor is the second class of DAAs and is a key drug contained in most regimens. Various amino acid substitutions, such as D168 mutation for NS3 resistance and L31/Y93 resistance for NS5A resistance, were shown in vitro to alter DAA susceptibility of HCV and may be related to a therapeutic effect in vivo.[40] The emergence of HCV-resistant virus against multiple drugs became a serious problem after treatment failure of combination DAA treatments with multiple RASs related to different classes of DAAs. Proper and careful drug selection with genetic analyses is recommended for treatment of such multidrug-resistant HCV.[41]
However, in conventional techniques, it is difficult to confirm that these mutations for different drug resistance exist on a single genome.[11,12,40] This study showed that such multi-resistant haplotypes appeared after treatment failure of DAA treatment with different RASs in a single viral genome. From the phylogenetic tree analysis, we successfully visualized resistant haplotypes contributing to treatment failure because some quasispecies before treatment with multiple mutations were selected at treatment failure. Furthermore, we also clarified that resistant haplotypes are replaced with haplotypes having lower resistance and possibly higher proliferative fitness when the selective pressure disappears after drug administration. We did not exclude signature RAS for phylogenetic analysis to detect haplotype-specific RAS. With a combination of RASs in a single viral genome, the existence of the DAA-resistant haplotype was revealed for the first time by using Nanopore sequencing. Similar results have been recently shown with PacBio SMRT, and further accumulation of knowledge for the multidrug-resistant HCV haplotype is expected.[21,28]
In addition, the present study successfully estimated viral diversities with genetic distance of the phylogenetic tree and found their differences among patients. Some cases (case 1 and case 7) had lower diversity compared to the others, and its clinical significance should be further studied. Shannon entropies increased over time, suggesting divergence of haplotype complexity is related to treatment failure to DAA with the emergence of different haplotypes with different RASs. Haplotypes that became dominant later were not always detected at earlier time points; for example, in case 3, 4, and 8, major haplotypes at treatment failure were not detected at the start of treatment (Figure 3). These newly appearing haplotypes were not necessarily de novo but might be due to the small sampling size of 100 reads that can hardly detect haplotypes with proportions lower than 1%. Analysis with much larger number of reads will be needed to better track the minor haplotypes.
In this study, we performed a single-molecule sequence of the 3.2-kb region, which is about one third of the 9.6-kb genome of HCV. In the future, it will be necessary to elucidate the relationship between the haplotype of full-length HCV genome mutations and pathophysiology.[20] Because the bioinformatics method we used was designed to detect only small deletions up to 1% of the target sequence, we could not evaluate the large structural variants (deletion, insertion, and recombination) of the HCV genome at a single-molecule level. Structural variation of the HCV genome has been reported, such as large deletions related to more advanced liver disease, suggesting greater pathogenicity of the defective virus[42–46] or recombination,[28,47] but the details are still unclear. Recently, we applied the present method modified for the whole HCV genome mutation and deletion analysis and detected large deletions of the structural protein gene previously reported at the single-molecule levels (data not shown). It is expected that single-molecule long-read sequencing will facilitate understanding the entire HCV genome structure and its pathogenicity.
The Nanopore sequencer has characteristics that are different from conventional sequencers, such as small size, low cost of a USB connection, and real-time sequence information with few pretreatments, and it can be used not only in the laboratory but also in field research and clinical practice.[34,48] The problem of a high error rate has been a common property of the long-read sequencer, but this problem may be overcome by introducing a technique to sequence the target repeatedly as a concatemer, such as RCA applied in this study. In 2021, Oxford Nanopore Technologies announced the development of Q20+ Chemistry, which enables single-molecule long-read sequencing with an accuracy of 99% or higher (
In the present study, we successfully clarified the kinetics of DAA-resistant haplotypes of HCV by combining Nanopore and RCA for single-molecule long-read sequencing. It is expected that this method can be applied not only to elucidate the relation between HCV genome structure and pathologies but also to analyze various viral diseases.
ACKNOWLEDGEMENTWe thank Dr. Koji Moriishi and Dr. Atsuya Yamashita for their scientific advice and Ms. Takako Ohmori and Ms. Tomoko Nakajima for their technical assistance.
CONFLICT OF INTERESTDr. Enomoto consults for, is on the speakers’ bureau for, and received grants from Gilead, AbbVie, Merck Sharp and Dohme, and Bristol-Myers Squibb. The other authors have nothing to report.
AUTHOR CONTRIBUTIONSKozue Yamauchi performed most of the experiments. Mitsuaki Sato performed the populational and deep sequencing. Leona Osawa, Shuya Matsuda, Yasuyuki Komiyama, Natsuko Nakakuki, Hitomi Takada, Ryo Katoh, Masaru Muraoka, Yuichiro Suzuki, Akihisa Tatsumi, Mika Miura, Shinichi Takano, Fumitake Amemiya, and Mitsuharu Fukasawa helped in clinical analyses. Yasuhiro Nakayama, Tatsuya Yamaguchi, and Taisuke Inoue assisted discussions. Shinya Maekawa and Nobuyuki Enomoto designed the study.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The method of analyzing individual resistant hepatitis C virus (HCV) by a combination of haplotyping and resistance‐associated substitution (RAS) has not been fully elucidated because conventional sequencing has only yielded short and fragmented viral genomes. We performed haplotype analysis of HCV mutations in 12 asunaprevir/daclatasvir treatment‐failure cases using the Oxford Nanopore sequencer. This enabled single‐molecule long‐read sequencing using rolling circle amplification (RCA) for correction of the sequencing error. RCA of the circularized reverse‐transcription polymerase chain reaction products successfully produced DNA longer than 30 kilobase pairs (kb) containing multiple tandem repeats of a target 3 kb HCV genome. The long‐read sequencing of these RCA products could determine the original sequence of the target single molecule as the consensus nucleotide sequence of the tandem repeats and revealed the presence of multiple viral haplotypes with the combination of various mutations in each host. In addition to already known signature RASs, such as NS3‐D168 and NS5A‐L31/Y93, there were various RASs specific to a different haplotype after treatment failure. The distribution of viral haplotype changed over time; some haplotypes disappeared without acquiring resistant mutations, and other haplotypes, which were not observed before treatment, appeared after treatment. Conclusion: The combination of various mutations other than the known signature RAS was suggested to influence the kinetics of individual HCV quasispecies in the direct‐acting antiviral treatment. HCV haplotype dynamic analysis will provide novel information on the role of HCV diversity within the host, which will be useful for elucidating the pathological mechanism of HCV‐related diseases.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
1 Department of Gastroenterology and Hepatology, Faculty of Medicine, University of Yamanashi, Yamanashi, Japan