ArticlePDF Available

Fast Screening and Primary Diagnosis of COVID-19 by ATR FT-IR

Authors:

Figures

Content may be subject to copyright.
Fast Screening and Primary Diagnosis of COVID-19 by ATRFT-IR
Liyang Zhang,
Meng Xiao,
Yao Wang,
Siqi Peng, Yu Chen, Dong Zhang, Dongheyu Zhang,
Yuntao Guo, Xinxin Wang, Haiyun Luo,*Qun Zhou,*and Yingchun Xu*
Cite This: Anal. Chem. 2021, 93, 21912199
Read Online
ACCESS Metrics & More Article Recommendations *
sıSupporting Information
ABSTRACT: The outbreak of coronavirus disease 2019 (COVID-19) has led to
substantial infections and mortality around the world. Fast screening and diagnosis
are thus crucial for quick isolation and clinical intervention. In this work, we
showed that attenuated total reectionFourier transform infrared spectroscopy
(ATRFT-IR) can be a primary diagnostic tool for COVID-19 as a supplement to
in-use techniques. It requires only a small volume (3μL) of the serum sample
and a shorter detection time (several minutes). The distinct spectral dierences
and the separability between normal control and COVID-19 were investigated
using multivariate and statistical analysis. Results showed that ATRFT-IR
coupled with partial least squares discriminant analysis was eective to dierentiate
COVID-19 from normal controls and some common respiratory viral infections or
inammation, with the area under the receiver operating characteristic curve
(AUROC) of 0.9561 (95% CI: 0.90710.9774). Several serum constituents
including, but not just, antibodies and serum phospholipids could be reected on the infrared spectra, serving as chemical
ngerprintsand accounting for good model performances.
INTRODUCTION
Coronavirus disease 2019 (COVID-19) is a pandemic caused
by severe acute respiratory syndrome coronavirus 2 (SARS-
CoV-2), a newly appearing coronavirus which has spread over
the world and led to substantial infections and mortality.
1
Reverse transcription polymerase chain reaction (RT-PCR)
is a conventional and standard assay for viral diagnosis and has
been widely used for SARS-CoV-2 RNA detection. SARS-
CoV-2 RNA can be detected in both upper and lower
respiratory specimens including nasal swab, oropharyngeal
swab, sputum, and bronchoalveolar lavage uid (BALF).
2,3
Despite BALF, which is not a requisite for COVID-19
diagnosis because of the harder sampling, the sputum was
reported to have the highest positive rate (74.488.9%),
followed by nasal swabs (53.673.3%) during the rst 14 days
after onset (d.a.o.).
3
The positive rate for throat swab was
reported to be around 60%.
3,4
Viral RNA is also detected in
serum samples with a percentage of 0% (0/31),
5
8% (1/12),
6
or 15% (6/41).
7
Notably, several factors may inuence the
performance of RT-PCR such as improper sample preparation
or varied qualities of detection kits and thus lead to high false-
positive rates. In addition, viral replication is inhibited in the
late stage of infection, accounting for the high false-negative
rates in this stage. Also, it is time-consuming to perform the
whole test procedures.
Serological assay based on immunoglobulin-G (IgG) and
IgM levels can serve as a complement to nucleic acid
detection.
8,9
The median time of IgM and IgG seroconversion
was reported to be 5 (n= 41) and 14 (n= 208) days after
onset, respectively.
10
The combination of IgM and IgG tests
yielded a higher detection sensitivity of 88.66% and specicity
of 90.63% (397 PCR conrmed patients and 128 negative
patients in total) than a single IgG or IgM test.
11
Additionally,
a higher positive detection rate of 99.4% (n= 173, 95% CI
96.8100%) was achieved when applying both antibody and
nucleic acid tests, compared to a single RNA test of 67.1%
(95% CI 59.474.1%).
12
Nevertheless, there remain some
problems unclear such as the antibody responses of COVID-19
patients, the potential false positive caused by immunological
cross reactivity, and the varied performances of commercially
available detection kits.
Rapid and reliable diagnosis of COVID-19 is of great
signicance to help screen the COVID-19 patients and deliver
more appropriate treatment. In the last decade, transmission or
attenuated total reection (ATR) Fourier transform infrared
spectroscopy (FT-IR) and Raman spectroscopy have been
utilized to identify viral infections or predict viral load in
blood,
13
sera,
1315
plasma,
16
or infected cells,
17,18
dierentiate
dierent viral infections,
19
and verify the infectious agent type
(as bacterial or viral) based on the white blood cell data.
20
Subtle molecular and chemical changes in blood components
Received: September 25, 2020
Accepted: December 18, 2020
Published: January 11, 2021
Articlepubs.acs.org/ac
© 2021 American Chemical Society 2191
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
This article is made available via the ACS COVID-19 subset for unrestricted RESEARCH re-use
and analyses in any form or by any means with acknowledgement of the original source.
These permissions are granted for the duration of the World Health Organization (WHO)
declaration of COVID-19 as a global pandemic.
Downloaded via 223.72.51.182 on March 14, 2021 at 05:57:33 (UTC).
See https://pubs.acs.org/sharingguidelines for options on how to legitimately share published articles.
in response to bacterial or viral infections can be recorded and
reected by the infrared spectra. For example, the strong band
at 1631 cm1attributed to the β-pleated sheet protein marker
of Ig is unique to the positive serum spectra induced by
hepatitis B and C virus.
14
In comparison with other assays,
infrared spectroscopy enables us to inspect almost all biological
components at once, which may be benecial to COVID-19
diagnosis. Additionally, it is easier to perform and takes less
operation time (typically for several minutes).
In this work, we showed the feasibility of ATRFT-IR in
COVID-19 screening and primary diagnosis. The spectral
dierences between COVID-19 and healthy controls and the
potential spectral markers were identied by multivariate and
statistical analysis. For the purpose of the performance test,
especially the specicity of the proposed model, healthy
controls and some common respiratory viral infections or
inammation were considered.
MATERIALS AND METHODS
Participants. We collected a total of 115 blood samples
from 20 healthy donors and 76 patients, of which 41 were
conrmed with COVID-19, 15 had respiratory viral infections
caused by inuenza A/B or respiratory syncytial virus (RSV),
and 20 were with inammation-related diseases (Table 1).
Inuenza A/B and RSV are chosen because they are common
in respiratory infections and share similar u-like symp-
toms
21,22
to COVID-19. Based on a report regarding 1099
patients, the most common symptoms of COVID-19 are fever
(43.8% on admission and 88.7% during hospitalization) and
cough (67.8%).
23
We know that inammation-related diseases
can also contribute to alterations in serum components (see
Discussion). Here, respiratory bacterial infections, pulmonary
infection, intra-abdominal infection, bacteremia, and some
other diseases were enrolled in the study (Table S1). We aim
to investigate whether and how the serum infrared spectra are
specic for COVID-19.
The COVID-19 patients were from two cohorts, all of whom
were diagnosed by reverse transcriptase PCR (RT-PCR)
following China national guidelines for diagnosis and treatment of
Corona Virus Disease 2019 (COVID-19) (trial version 5,
revised).
24
The rst cohort included 35 critically ill COVID-19
patients who were admitted from Feb 15 to March 30 at the
Sino-French New City Branch of Tongji Hospital in Wuhan.
Of the 35 patients, one was with <7 days postsymptom onset
and one was within 714 days, while 33 were with >14 days.
In the second cohort collected in the Peking Union Medical
College Hospital, ve conrmed cases were with >14 days after
onset, while one was with 2 days. All of them were with mild
symptoms. Clinical residual blood samples (stored at 80 °C)
from inuenza A, inuenza B, or respiratory syncytial virus
(RSV)-infected patients admitted from Feb 16 to March 1,
2020, were revived. The viral nucleic acids were determined by
XPert Xpress Flu/RSV (Cepheid AB, Sweden). Of note, for
each patient with inuenza A or RSV infections, two to three
blood specimens from dierent infection stages before
recovery were collected to enrich and generalize the data set.
Patients with other diseases were diagnosed using standard
clinical methods and proved to be without any SARS-CoV-2
infection (Table S1). This study was approved by the Ethics
Committees from Tsinghua University and Peking Union
Medical College Hospital.
Sample Preparation and ATRFTIR Spectroscopy.
The serum samples were obtained by blood centrifugation
under 5000 rpm for 5 min. Prior to measurements, the serum
specimens were incubated at 56 °C for 30 min to inactivate the
potential pathogens. Each specimen was measured one to three
times on a PerkinElmer infrared spectrometer coupled with a
diamond ATR accessory at a resolution of 4 cm1. Sixteen
scans were accumulated per spectrum. For each measurement,
an aliquot of 3 or 4 μL serum sample was transferred onto the
ATR crystal and allowed to dry under mild airow at room
temperature. Water absorption was monitored by OH
stretching at around 3300 cm1and bending at around 1635
cm1. It took about 35 min for samples to be suciently
dried. Afterward, the spectra between 4000 and 600 cm1were
collected. The spectral background was recorded separately for
each sample to achieve a higher signal-to-noise ratio. Prior to
data analysis, the raw spectral data were preprocessed with
baseline calibration using the built-in rubber-band algorithm
on the PerkinElmer spectrum (version 10.5.4). The corre-
sponding constant baseline drift at 4000 cm1was subtracted
for each sample spectrum, and then, all the spectra were
normalized to amide I by peak absorbance. The second-
derivative infrared (SD-IR) spectra were calculated from the
normalized spectra.
Multivariate and Statistical Analysis. All data analyses
were performed on MATLAB (version R2020b, Mathworks,
Natick, U.S.A). The second-derivative infrared (SD-IR)
spectra were calculated by the 13-point SavitzkyGolay
algorithm using in-house protocols.
Analysis of variance (ANOVA) is a well-established method
to evaluate the dierence of the observed statistic(s) among
groups by calculating the F-statistic (i.e., the ratio of between-
group variance to within-group variance).
25
The larger F-
statistic yields a lower pvalue, indicating that the groups are
more likely to be dierent in this statistic. In this work, the p
value was used to evaluate the statistical spectral dierences of
major bands in absorbances or band locations of healthy
controls and COVID-19 patients. Hence, the potential spectral
markers can be primarily identied.
Hierarchical cluster analysis (HCA) and principal compo-
nent analysis (PCA) are widely used unsupervised approaches
in microorganism dierentiation.
2628
In HCA, the spectral
distance (usually Euclidian distance) is calculated to construct
the hierarchical structure, in which root nodes are formed by
spectra with the shortest distances, while leaf nodes are formed
by the longest distances. PCA aims to reduce the dimensions
of the original matrix data in terms of maximum variance.
26,29
In the new vector space formed by principal components
(PC), data distribution is more apparent. Here, the natural
Table 1. Number of Participants, Specimens, and the
Measured Spectra in This Study
participants
(n= 96) specimens
(n= 115) spectra
(n= 289)
COVID-19
(cohort 1) 35 35 59
COVID-19
(cohort 2) 6618
Inuenza A 4 8 23
Inuenza B 2 2 5
RSV 9 24 73
inammation
a
20 20 60
control 20 20 51
a
Details are presented in Table S1.
Analytical Chemistry pubs.acs.org/ac Article
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
2192
separability of normal controls and nonsevere and severe
COVID-19 patients was tested by the two methods.
Partial least squares discriminant analysis (PLS-DA) is a
supervised chemometric technique whereby the so-called
latent variables are successively extracted to nd the maximum
correlation between the X-matrix and Y-matrix.
14,30
In the
present study, the data set was divided into three groups:
normal controls, COVID-19, and infections caused by other
diseases. PLS1 and PLS2 models
31
were applied for two-group
and three-group classication, respectively. Specically, in the
case of PLS1, the two classes were coded as [0] and [1],
whereas the three classes were coded as [1 0 0], [0 1 0], and [1
0 0] in the PLS2 model. In comparison with PCA, PLS-DA is
more suitable to identify the signicant wavenumbers for
discrimination by inspecting the regression vector
14
or the
variable importance in projection (VIP).
32
RESULTS
COVID-19 Vs Controls: Band Assignments, Spectral
Dierences, and Spectral Interpretation. It is known that
serum is composed of proteins, cholesterol, glucose, urea,
triglycerides, and other more dilute compounds, all of which
can be recorded in the spectra, whereas only components with
higher abundance could be identied in the spectra and
provide insightful information. Reports show that a wide range
of abundant biomolecules in plasma can be quantied using
FTIR.
33
To better illustrate the spectral features, we
summarized the normal levels of the major serum components
(Table S2).
Averaged spectra minimize the inuence of individual
dierences and thus are more representative. As for the
original spectra, all bands had signicant frequency shifts in the
COVID-19 group (p< 0.001, Figure 1a and Table 2),
Figure 1. Spectral proles of COVID-19 and control serum samples and some abundant constituents in human serum. (a) Averaged original
spectra of COVID-19 and control samples. Spectra were normalized to amide I. The principal absorbance bands were annotated. (b) Averaged SD-
IR spectra of COVID-19 and control samples and the most abundant constituents in human serum. Bands with notable variations were annotated
(see also Table S3). Lysolecithin, sphingomyelin, lecithin, and human IgG (Bioss, Beijing, China, puried by Protein-A anity chromatography)
were in dried powder form, whereas human serum albumin was buered with water. The band located at 981 cm1in the human IgG spectrum
may arise from impurities.
Table 2. Normalized Spectra: Band Assignments and Statistical Comparisons between COVID-19 and Control Spectra in Band
Locations and Relative Absorbance
a
,
d
,
e
,
f
band locations/(cm1) relative absorbance (a.u.)
assignments
19,27
control COVID-19 pcontrol COVID-19 p changes (%)
b
amide A 3285.65 (1.04) 3282.75 (1.19) *** 0.430 (0.028) 0.500 (0.049) *** +16.3
νas(CH3) 2958.31 (0.55) 2958.99 (0.83) *** 0.215 (0.007) 0.217 (0.008)
νas(CH2) 2930.76 (1.05) 2929.71 (1.55) *** 0.220 (0.010) 0.225 (0.015) *+2.3
νs(CH3) 2872.61 (0.49) 2873.14 (0.76) *** 0.150 (0.006) 0.151 (0.008)
amide I 1642.27 (2.28) 1636.84 (2.24) *** 1 (0)
c
1 (0)
c
amide II 1537.18 (1.05) 1538.19 (0.95) *** 0.878 (0.03) 0.857 (0.025) *** 2.4
δ(CH2) 1453.33 (0.62) 1452.9 (0.7) *** 0.342 (0.021) 0.339 (0.014)
νs(COO) 1397.53 (1.05) 1398.22 (0.68) *** 0.386 (0.024) 0.380 (0.016)
amide III 1308.06 (1.7) 1311.19 (1.14) *** 0.280 (0.021) 0.279 (0.022)
νas(PO2) 1242.29 (0.81) 1241.62 (1.01) *** 0.264 (0.023) 0.271 (0.016)
νs(COC) 1170.12 (0.33) 1167.6 (2.85) *** 0.154 (0.016) 0.151 (0.010)
νs(PO2) 1079.63 (0.77) 1077.36 (1.02) *** 0.162 (0.017) 0.193 (0.025) *** +19.1
a
νs: symmetric stretching vibrations; νas: asymmetric stretching vibrations; δ: bending vibrations. Data are in mean (standard deviation) or %.
b
Increases in average band absorbance compared to controls.
c
All the spectra were normalized to amide I.
d
Statistical dierences were compared
between COVID-19 and the control group using one-way ANOVA.
e
*p< 0.05. **p< 0.01. ***p< 0.001.
f
See more in Figure S2.
Analytical Chemistry pubs.acs.org/ac Article
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
2193
indicating the notable changes in protein and lipid
conformation. Bands related to proteins include amide A
(NH stretching vibration), amide I (CO stretching),
amide II (CN stretch and NH bending), and amide III. As
seen from Table 2, the relative absorbance values of both
amide A and amide II showed signicant dierences between
COVID-19 and the control group (p< 0.001), denoting the
alterations in protein concentrations.
We found that the amide I band of the COVID-19 group
had a signicant red shift of roughly 6 cm1compared to the
control (p< 0.001, Figure 1a), suggesting the transitions in
protein secondary structures. This is further illustrated by the
SD-IR spectra (Figures 1b, S3, and Table S3). The band
centered at 1651 cm1from the α-helix structure was
signicantly lower (p< 0.001), while the band centered at
1632 cm1attributed to the β-pleated sheet structure
26
was
signicantly higher (p< 0.001) in the COVID-19 group,
responsible for the notable frequency shift of amide I in the
original spectra. Human serum albumin (HSA) (about 70% of
serum by weight), immunoglobulin (IgG: 14%), transferrin
(5.7%), and α-antitrypsin (0.7%) are among the most
abundant serum proteins (Table S2).
35
HSA is predominated
by α-helix and Ig by the β-sheet (also see Figure 1b).
3638
Hence, bands centered at 1651 and 1632 cm1principally
provide hints toward HSA and Ig in human sera, respectively
(see more in the Discussion section).
The peak at 1397 cm1is associated with COOsymmetric
stretching mainly from aspartic acids and glutamic acids in
protein side chains.
34
No signicant change was observed in
the absorbance of this band in the COVID-19 group, despite
the slight band shift. Bands centered at 1242 and 1078 cm1
correspond to asymmetric and symmetric stretches from PO2
groups, respectively.
34
Compared to controls, there was no
signicant dierence in the relative absorbance of the former
band in COVID-19 sample spectra (p> 0.05), whereas a
signicant increase (by 19.1%) was found in the latter band (p
< 0.001, Figure 1a and Table 2). The PO2functional group
may arise from nucleic acids or phospholipids in serum.
However, nucleic acids such as cell-free circulating DNA
(cirDNA)
21
or SARS-CoV-2 RNA
5
exhibit limited abundance
in human serum. Hence, increases in the band centered at
1078 cm1can be attributed to phospholipids. It is known that
lecithin, sphingomyelin, and lysolecithin are the three major
phospholipids in normal human serum, accounting for 95% on
a weight percent basis (see also Table S1).
39,40
In the mild,
severe, and fatal COVID-19 patients, sphingolipids and
lysolecithin (12:0/0:0) were observed with a 3.55.5-fold
increase (log2fold change: 1.82.5), whereas lecithin
decreases by 50%80% (2.5 < log2fold change < 1.0).
41
Considering sphingolipids to be the most abundant phospho-
lipids in serum, the stronger absorption in the band centered at
1078 cm1may reect the higher total phospholipid content in
COVID-19 patients. This nding was further validated by the
SD-IR spectra shown in Figure 1b, where the νs(PO2)
absorption band was found at 1086 and 1083 cm1in the SD-
IR spectra of sphingomyelin and lysolecithin, respectively. This
slight band shift from 1079 cm1may result from sample states
as the purchased samples were in dried powder form.
The bands centered at 1170 cm1in the control group and
1166 cm1in COVID-19 sample spectra originate from ester
COC asymmetric stretching of phospholipids, triglycerides,
and cholesterol esters.
42
No signicant absorbance alterations
Figure 2. Discrimination among normal controls and nonsevere and severe COVID-19 patients with unsupervised methods. (a,b) PCA score plot
using spectral ranges of 16001700 cm1and 10001700 cm1, respectively. Spectra from three patients are marked with arrows. NS = nonsevere.
S = severe. (c) Hierarchical cluster analysis (10001700 cm1). The windows corresponding to the three groups are lled with dierent colors. 1 =
normal controls; 2 = nonsevere COVID-19 patients; 3 = severe COVID-19 patients.
Analytical Chemistry pubs.acs.org/ac Article
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
2194
were observed. Nevertheless, we found that this band together
with the bands centered at 2925 and 2853 cm1in the SD-IR
spectra had a signicant blue shift in the absorption frequency
(Figure 1b and Table S3), suggesting the potential conforma-
tional changes in lipid components.
Dierentiate COVID-19 Patients from Normal Con-
trols. PCA was rst performed on the SD-IR spectra to reduce
the dimension of infrared spectral space. It requires the rst
ve and ten components to account for 90% of the total
variance in the data ranging from 1600 to 1700 cm1and 1000
to 1700 cm1, respectively. Nevertheless, the score plot with
respect to the rst two PCs provides meaningful information
(Figure 2). Obvious transitions from normal controls to
nonsevere COVID-19 patients and to the severe ones were
observed (Figure 2a). Overlapping was mitigated when
applying the spectral range of 10001700 cm1(Figure 2b).
This implies that other serum components in addition to
proteins may play a role in dierentiating between healthy
controls and COVID-19 patients and evaluating the illness
severities. Two patients (NS348 and NS457) in the nonsevere
group and one patient (S55) in the severe group were found to
be closer to normal controls. It is noteworthy that patient
NS348 was in the early stage of illness with 2 days after onset,
while NS457 was an asymptomatic patient who was validated
with at least >14 d.a.o. S55 was a critically ill patient but with
time after onset of <7 days when sampled. Note that two
components explain about 60% and 55% of the total variance
when applying a spectral window of 16001700 or 1000
1700 cm1, respectively. For comparison with the 2-PC results,
the score plot regarding three PCs is shown in Figure S4.
We further applied HCA to classify the groups using the
shortest distance method and Euclidian distance based on the
spectral region of 10001700 cm1. Most of the severe
patients were clustered together except six spectra from
patients S57, S14, S24, and S55 (Figure 2c). Among the
spectra from nonsevere patients, six of them from patients
NS348 and NS457 were mixed with normal controls, whereas
the remaining was separated well with others (Figure 2c).
Results of the two unsupervised methods show the potential
feasibility of FT-IR to discern COVID-19 patients in dierent
severities, yet supervised learning methods with the labeled
data set may provide more insightful information.
PLS-DA is commonly used in chemometrics for its ability to
handle multivariate data and high model interpretability.
31
We
rst performed a PLS-DA analysis using SD-IR spectra in the
region between 1000 and 1700 cm1. The rst latent variable
(LV, i.e., score) was extracted under leave-one-out cross
validation with an R2Yvalue of 0.793 and a Q2value of 0.790.
Regularly, Q2> 0.5 is admitted for good model predictability.
43
The regression vector of variables (loadings for LV1) and the
corresponding variable importance in the projection (VIP)
values are presented in Figure 3a. VIP is a parameter to
evaluate the inuence of individual X-variables on the model
and is preferred if the value is larger than one.
32,44
Clearly, the
most discriminatory bands are located at 1655, 1625, 1557,
1506, 1074, and 1035 cm1. This nding is consistent with the
SD-IR spectra shown in Figure 1b.
In the PLS-DA model, normal controls and COVID-19
samples were labeled 0and 1, respectively. The receiver
operating characteristic (ROC) curve was generated by
disturbing the decision threshold. The area under the curve
(AUC) value of the model was 0.9947 (95% CI 0.9769
0.9985). From the model output shown in Figure 3b, we
observed a distinct separation between the two classes, despite
six sample spectra which were from patient NS348 and NS457
mixed with controls, that is, when screening COVID-19
patients, especially the ones with moderate symptoms, such a
simple model based on a single latent variable may not be
eective enough. Nevertheless, SD-IR spectra coupled with
PLS-DA analysis achieve high sensitivity and specicity when
choosing proper decision thresholds, at least for the training
data in this work.
Dierentiate COVID-19 from Normal Controls, Res-
piratory Viral Infections, and Inammation-Related
Diseases. Patients with inuenza A/inuenza B/RSV
infections or common inammatory diseases were enrolled in
as interferences to further assess the performance of the
proposed method. We rst applied a database searching
algorithm based on Pearsons correlation coecient.
45
Despite
the high sensitivity for the controls, the overall performance is
limited (Table S4). PCA was not satisfactory either (not
shown), three PCs explaining about 5055% of the data
variance when using dierent spectral windows. The above-
mentioned results highlight the risks of misclassication using
unsupervised methods.
Then, a triple-class PLS-DA model was established to
dierentiate the following three groups: normal controls,
COVID-19, and other diseases. The spectral range between
900 and 1700 cm1achieved the best results. Three most
frequent cross-validation (CV) methods, namely, leave-one-out
Figure 3. Results of the PLS-DA model for classication between COVID-19 and control groups. (a) Regression vector with respect to the spectral
region of 9001700 cm1. VIP values in dierent ranges are shown in dierent colors. VIP = variable inuence on projection. (b) ROC plot of
COVID-19 samples (outer) and the model output (inner). Two threshold values corresponding to two points in the ROC curve were selected.
AUC = area under curve. CI = condence interval.
Analytical Chemistry pubs.acs.org/ac Article
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
2195
(LOO), 7-fold, and 10-fold were used to determine the
optimum number of latent variables by means of the prediction
error rate.
30,46
When applying 7-fold or 10-fold CV, the data
were randomly rearranged, which was repeated 100 times to
obtain statistical results. Clearly, three methods achieved
consistent prediction error rates, indicating good model
predictability (Figure 4a). In view of the model complexity,
nally, ve LVs was selected. Fewer variables are not adequate,
whereas more variables contribute little to the prediction
capability.
The PLS-DA model with ve LVs using the whole dataset
was then applied to produce the ROC curves (Figure 4b).
Here, the model output is a matrix of n×3, each column
corresponding to one class. By disturbing the decision
threshold, we got one ROC curve for each class. This helps
illustrate how one class is dierentiated from others. As shown
in Figure 4b, the control group achieves the highest AUC value
of 0.9994 (95% CI: 0.99701.0000), which suggests that
normal controls are nearly not likely to be identied as patients
by the model and vice versa. The AUC values for COVID-19
and other diseases are 0.9561 (95% CI: 0.90710.9774) and
0.9588 (95% CI: 0.93270.9752), respectively. The sensitivity
and specicity of COVID-19 identication can be adjusted by
modifying the decision thresholds. For instance, when it was
set as 0.288 (threshold 1, Figure 4b), a high sensitivity of 87%
was achieved, whereas a high specicity of 98% was fullled
when it was set as 0.383 (threshold 2, Figure 4b, sensitivity:
83.1%). Overall, threshold 1 is more acceptable. We found that
nonsevere patients NS348 and NS457 were still mixed with
others (Figure 4b, inner graph). Finally, VIP scores were
calculated to inspect the most signicant peaks for
classication. As indicated from Figure 4c, bands in spectral
ranges of 14501650 and 10501100 cm1should be paid
attention to. Of a simple note, inammatory markers may not
be likely to be distinguished by infrared spectroscopy because
of their relatively low levels (<1 mg/mL, or even lower than
ng/mL, Table S2). We may conclude that the discriminatory
bands may arise from subtle changes in proteins or lipids or
other constituents in serum.
DISCUSSION
The selection of biospecimens is necessary to be discussed. In
this study, we chose serum because of its relatively stable
components, thus minimizing the individual dierences caused
by disease-irrelevant factors. However, other types of speci-
mens also deserve trying, for example, white blood cells.
COVID-19 patients, especially for severe cases, are reported
with signicantly lower total lymphocytes, CD4+T cells, CD8+
T cells, B cells, and NK cells (P < 0.001),
47
as well as reduced
percentages of monocytes, eosinophils, and basophils.
48,49
Virus-containing samples such as nasal and throat swabs may
be feasible too.
As for ATRFT-IR measurements, several factors may
inuence the qualities of the spectra. For instance, hemolysis
caused by improper sampling might lead to spectral distortion
and impede correct identication (data not shown). It is
known that unfolding, conformational changes, and denatura-
tion may happen in proteins when exposed to elevated
temperatures.
50,51
For the samples collected, preheating at 56
°C for 30 min before measurement contributes to minor
spectral dierences in most cases, whereas it may lead to
relatively large variations in very few cases. The inuence of the
thermal factor on serum components and the corresponding
infrared spectra is worthwhile to further investigate. Never-
Figure 4. PLS-DA model performances for the triple-class classication. (a) Prediction error rates as a function of the number of latent variables.
For 7-fold and 10-fold cross-validation, the error bars were presented. (b) ROC graphs for each group. The inner graph shows the model predicted
output of the COVID-19 class. The decision threshold values of 1 and 2 are 0.288 and 0.383, respectively. (c) VIP scores for each class. Signicant
peaks were labeled.
Analytical Chemistry pubs.acs.org/ac Article
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
2196
theless, the IgG secondary structure seems to be stable within
temperatures between 20 and 55 °C.
51
Of another note, when
considering the practical operation, protein coagulation
induced by heat may make it harder for uniform sampling.
Immune responses to infection can contribute to alterations
in serum constituents, and several biomarkers have been
suggested to identify certain infections. For example,
procalcitonin (PCT), circulating cytokines (interferon [IL]-
1β, IL-6, IL-18, etc.), and acute-phase proteins (C-reactive
protein [CRP], ferritin, etc.) have been used to dierentiate
between bacterial and viral infection.
52,53
However, it is not
always an easy task to nd specic markers for a given viral
infection. Over the past several months, proteomic and
metabolomic proles of peripheral blood samples have been
investigated to correlate disease severity of COVID-19 with
certain proteins and lipids.
54,55
Details can be found elsewhere.
We know that FT-IR also provides hints toward proteins,
lipids, and other constituents in sera, but in a more
comprehensive and macroscopic view. Provided that human
serum is dominated by proteins and proteins are dominated by
albumin and Ig (Table S2, see also Figure 1b), we can
postulate that alterations in the amide I are closely related to
the two types of proteins. The increases in Ig levels and
decreases in albumin levels in the sera of patients with
COVID-19, revealed by the SD-IR spectra in Figure 1b, have
been proved by a lot of reports. The IgG level in the patient
sera elevates several times after the onset of COVID-19.
56
A
report by Long et al. showed that 19.5% (8/41) of the patients
with COVID-19 have a fourfold increase in the IgG titers.
9
The decreased albumin level was also reported by several
reports.
54,57
The reduction in the albumin level is related to the
acute phase response (APR) of patients to viral infection. The
hepatic synthesis of proteins is drastically regulated during the
acute phase of illnesses such as infection, tissue injury,
neoplastic growth, or immunological disorders, usually with
increased C-reactive protein (CRP) and serum amyloid A and
decreased transferrin and albumin.
58
The dysregulation of
phospholipids in patients with COVID-19 has also been
indicated by the SD-IR spectra shown in Figure 1b, which has
been discussed earlier (Results, part one).
More detailed interpretation of the infrared spectra and the
corresponding spectral dierences still lack a lot of knowledge.
The multivariate and statistical analysis provided chances for
the evaluation of COVID-19 diagnosis by infrared spectrosco-
py. In summary, the second-derivative spectrum can increase
the separation of the overlapping bands and thus is more
powerful than the original spectrum. We show that both
unsupervised and supervised methods are promising to
dierentiate COVID-19 patients from healthy controls.
Nonsevere and severe patients can be separated by
unsupervised methods such as PCA and HCA. However, as
indicated by this work, asymptomatic patients or newly
diagnosed patients with low IgG levels may not be correctly
identied. In this case, the decision threshold of PLS-DA
should be decided with caution. Lower threshold values yield
higher sensitivity but lower specicity.
For the purpose of diagnosis, which is no doubt tougher
than the only discrimination between COVID-19 and controls,
common respiratory viral infections or inammation were
considered as interferences. Unsupervised methods were not
so eective, whereas the PLS-DA model yielded good
performances. For a given threshold, the model achieved a
sensitivity of 83.1% and specicity of 98%. Bands in spectral
ranges of 14501650 and 10501100 cm1were the most
responsible for the model discrimination capability. Further
illustration is required in future work.
CONCLUSIONS
Taking the time cost, operation complexity, and detection
performance into consideration, FTIR coupled with multi-
variate analysis is a feasible tool for screening and primary
diagnosis of COVID-19. Several serum constituents including,
but not just, antibodies and serum lipids could be reected on
the infrared spectra, serving as chemical ngerprintsand
accounting for good model performances. It was also showed
that this assay exhibited high identication specicity for
COVID-19 under the intervention from common respiratory
viruses and inammation-related diseases.
In terms of practical use, more clinical data are required to
generalize the model, and special attention should be paid to
the asymptomatic patients. Nevertheless, infrared spectroscopy
can serve as an assistant diagnosis tool as a supplement to in-
use techniques. The feasibility of other common specimens
such as oropharyngeal swabs, identication and interpretation
of the spectral markers, the potentials of FTIR spectroscopy on
antibody response monitoring, and COVID-19 severity
prediction require further investigations.
ASSOCIATED CONTENT
*
sıSupporting Information
The Supporting Information is available free of charge at
https://pubs.acs.org/doi/10.1021/acs.analchem.0c04049.
Patients with inammatory diseases included in this
study; molar and mass concentrations of major
constituents in normal human serum/plasma; second-
derivative spectra: band assignments and statistical
comparisons between COVID-19 and control serum in
band locations and relative absorbance; performance of
the database searching algorithm using dierent spectral
regions; original and the corresponding second-deriva-
tive infrared spectra from healthy controls (number of
spectra = 51) and nonsevere (n= 18) and severe (n=
59) COVID-19 patients; box plot of relative absorbance
(to amide I) of the existing bands in the original spectra;
box plot of absorbance of the existing bands in the SD-
IR spectra; and three-principal-component score plot of
PCA results (PDF)
AUTHOR INFORMATION
Corresponding Authors
Haiyun Luo Department of Electrical Engineering, Tsinghua
University, Beijing 100084, China; Email: lhy@
tsinghua.edu.cn
Qun Zhou Department of Chemistry, Key Laboratory of
Bioorganic Phosphorus Chemistry & Chemical Biology
(Ministry of Education), Tsinghua University, Beijing
100084, China; Email: zhouqun@tsinghua.edu.cn
Yingchun Xu Department of Laboratory Medicine, Peking
Union Medical College Hospital, Peking Union Medical
College, Chinese Academy of Medical Sciences, Beijing
100730, China; Email: xycpumch@139.com
Analytical Chemistry pubs.acs.org/ac Article
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
2197
Authors
Liyang Zhang Department of Electrical Engineering,
Tsinghua University, Beijing 100084, China; orcid.org/
0000-0001-9909-9263
Meng Xiao Department of Laboratory Medicine, Peking
Union Medical College Hospital, Peking Union Medical
College, Chinese Academy of Medical Sciences, Beijing
100730, China
Yao Wang Department of Laboratory Medicine, Peking
Union Medical College Hospital, Peking Union Medical
College, Chinese Academy of Medical Sciences, Beijing
100730, China
Siqi Peng Department of Electrical Engineering, Tsinghua
University, Beijing 100084, China; orcid.org/0000-0001-
5418-3656
Yu Chen Department of Laboratory Medicine, Peking Union
Medical College Hospital, Peking Union Medical College,
Chinese Academy of Medical Sciences, Beijing 100730, China
Dong Zhang Department of Laboratory Medicine, Peking
Union Medical College Hospital, Peking Union Medical
College, Chinese Academy of Medical Sciences, Beijing
100730, China
Dongheyu Zhang Department of Electrical Engineering,
Tsinghua University, Beijing 100084, China
Yuntao Guo Department of Electrical Engineering, Tsinghua
University, Beijing 100084, China
Xinxin Wang Department of Electrical Engineering,
Tsinghua University, Beijing 100084, China
Complete contact information is available at:
https://pubs.acs.org/10.1021/acs.analchem.0c04049
Author Contributions
L.Z., M.X., and Y.W. contributed equally.
Notes
The authors declare no competing nancial interest.
ACKNOWLEDGMENTS
This work was funded by the National Natural Science
Foundation of China (52041701), the Tsinghua University-
Peking Union Medical College Hospital Initiative Scientic
Research Program (20191080604), Tsinghua University
Spring Breeze Fund (2020Z99CFG007), and Beijing Nova
Program (Z201100006820127).
REFERENCES
(1) WHO, Coronavirus disease (COVID-2019) situation reports.
https://www.who.int/emergencies/diseases/novel-coronavirus-2019/
situation-reports/ (accessed June 1, 2020), June 1, 2020.
(2) Zhou, P.; Yang, X.-L.; Wang, X.-G.; et al. Nature 2020,579,
270273.
(3) Yang, Y.; Yang, M.; Yuan, J.; Wang, F.; Wang, Z.; Li, J.; Zhang,
M.; Xing, L.; Wei, J.; Peng, L.; Wong, G.; Zheng, H.; Wu, W.; Shen,
C.; Liao, M.; Feng, K.; Li, J.; Yang, Q.; Zhao, J.; Liu, L.; et al. The
Innovation 2020,1, No. 100061.
(4) Ai, T.; Yang, Z.; Hou, H.; et al. Radiology 2020,296, E32E40.
(5) Wölfel, R.; Corman, V. M.; Guggemos, W.; et al. Nature 2020,
581, 465469.
(6) Young, B. E.; Ong, S. W. X.; Kalimuddin, S.; et al. JAMA, J. Am.
Med. Assoc. 2020,323, 14881494.
(7) Huang, C.; Wang, Y.; Li, X.; et al. Lancet 2020,395, 497506.
(8) Li, X.; Geng, M.; Peng, Y.; Meng, L.; Lu, S. J. Pharm. Anal. 2020,
10, 102108.
(9) Long, Q.-X.; Liu, B.-Z.; Deng, H.-J.; et al. Nat. Med. 2020,26,
845848.
(10) Guo, L.; Ren, L.; Yang, S.; et al. Clin. Infect. Dis. 2020,71, 778
785.
(11) Li, Z.; Yi, Y.; Luo, X.; et al. J. Med. Virol. 2020,92, 15181524.
(12) Zhao, J.; Yuan, Q.; Wang, H.; et al. Clin. Infect. Dis. 2020,71,
2027.
(13) Santos, M. C. D.; Nascimento, Y. M.; Araújo, J. M. G.; Lima, K.
M. G. RSC Adv. 2017,7, 2564025649.
(14) Roy, S.; Perez-Guaita, D.; Bowden, S.; Heraud, P.; Wood, B. R.
Clinical Spectroscopy 2019,1, 100001.
(15) Sitole, L.; Steffens, F.; Krüger, T. P. J.; Meyer, D. OMICS 2014,
18, 513523.
(16) Sakudo, A.; Tsenkova, R.; Onozuka, T.; et al. Microbiol.
Immunol. 2005,49, 695701.
(17) Salman, A.; Erukhimovitch, V.; Talyshinsky, M.; Huleihil, M.;
Huleihel, M. Biopolymers 2002,67, 406412.
(18) Erukhimovitch, V.; Talyshinsky, M.; Souprun, Y.; Huleihel, M.
FTIR Microscopy Detection of Cells Infected With Viruses. In DNA
Viruses: Methods and Protocols; Lieberman, P. M., Ed.; Humana Press:
Totowa, NJ, 2005; Vol. 292, p 161.
(19) Santos, M. C. D.; Nascimento, Y. M.; Monteiro, J. D.; et al.
Anal. Methods 2018,10, 12801285.
(20) Agbaria, A. H.; Beck Rosen, G.; Lapidot, I.; et al. Anal. Chem.
2018,90, 78887895.
(21) Eccles, R. Lancet Infect. Dis. 2005,5, 718725.
(22) Falsey, A. R.; Walsh, E. E. Clin. Microbiol. Rev. 2000,13, 371
384.
(23) Guan, W. J.; Ni, Z. Y.; Hu, Y.; et al. N. Engl. J. Med. 2020,382,
17081720.
(24) China National Guidelines for Diagnosis and Treatment of
Corona Virus Disease 2019 (COVID-19). http://www.nhc.gov.cn/
yzygj/s7653p/202002/d4b895337e19445f8d728fcaf1e3e13a.shtml
(accessed Feb 15, 2020), Feb 8, 2020.
(25) Zhang, X.; Guo, Y.; Song, Y.; et al. Proteomics 2006,6, 5260
5268.
(26) Agbaria, A. H.; Beck Rosen, G.; Lapidot, I.; et al. Anal. Chem.
2018,90, 78887895.
(27) Wenning, M.; Scherer, S. Appl. Microbiol. Biotechnol. 2013,97,
71117120.
(28) Zarnowiec, P.; Lechowicz, L.; Czerwonka, G.; Kaca, W. Curr.
Med. Chem. 2015,22, 17101718.
(29) Carneiro, C. R.; Silva, C. S.; de Carvalho, M. A.; Pimentel, M.
F.; Talhavini, M.; Weber, I. T. Anal. Chem. 2019,91, 1244412452.
(30) Lee, L. C.; Liong, C.-Y.; Jemain, A. A. Analyst 2018,143,
35263539.
(31) Gromski, P. S.; Muhamadali, H.; Ellis, D. I.; et al. Anal. Chim.
Acta 2015,879,1023.
(32) Mehmood, T.; Liland, K. H.; Snipen, L.; Sæbø, S. Chemom.
Intell. Lab. Syst. 2012,118,6269.
(33) Dé
lé
ris, G.; Petibois, C. Vib. Spectrosc. 2003,32, 129136.
(34) Naumann, D. Appl. Spectrosc. Rev. 2001,36, 239298.
(35) Anderson, N. L.; Anderson, N. G. Mol. Cell. Proteomics 2002,1,
845867.
(36) He, X. M.; Carter, D. C. Nature 1992,358, 209215.
(37) Buijs, J.; Norde, W.; Lichtenbelt, J. W. T. Langmuir 1996,12,
16051613.
(38) Schroeder, H. W.; Cavacini, L. J. Allergy Clin. Immunol. 2010,
125, S41S52.
(39) Williams, J. H.; Kuchmak, M.; Witter, R. F. Lipids 1966,1,89
97.
(40) Hidaka, H.; Yamauchi, K.; Ohta, H.; Akamatsu, T.; Honda, T.;
Katsuyama, T. Clin. Biochem. 2008,41, 12111217.
(41) Wu, D.; Shu, T.; Yang, X.; et al. Natl. Sci. Rev. 2020,7, 1157
1168.
(42) Liu, K.-Z.; Shaw, R. A.; Man, A.; Dembinski, T. C.; Mantsch, H.
H. Clin. Chem. 2002,48, 499506.
(43) Triba, M. N.; Le Moyec, L.; Amathieu, R.; et al. Mol. BioSyst.
2015,11,1319.
(44) Galindo-Prieto, B.; Eriksson, L.; Trygg, J. J. Chemom. 2014,28,
623632.
Analytical Chemistry pubs.acs.org/ac Article
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
2198
(45) Primpke, S.; Lorenz, C.; Rascher-Friesenhausen, R.; Gerdts, G.
Anal. Methods 2017,9, 14991511.
(46) Liu, W.; Sun, Z.; Chen, J.; Jing, C. J. Spectrosc. 2016,2016,
1603609.
(47) Wang, F.; Nie, J.; Wang, H.; et al. J. Infect. Dis. 2020,221,
17621769.
(48) Qin, C.; Zhou, L.; Hu, Z.; et al. Clin. Infect. Dis. 2020,71, 762
768.
(49) Li, Q.; Ding, X.; Xia, G.; et al. EClinicalMedicine 2020,23,
100375.
(50) Huggins, C.; Jensen, E. V. J. Biol. Chem. 1949,179, 645654.
(51) Vermeer, A. W. P.; Norde, W. Biophys. J. 2000,78, 394404.
(52) Chalupa, P.; Beran, O.; Herwald, H.; Kaspř
íková
, N.; Holub, M.
Infection 2011,39, 411417.
(53) Slaats, J.; ten Oever, J.; van de Veerdonk, F. L.; Netea, M. G.
PLoS Pathog. 2016,12, e1005973.
(54) Shen, B.; Yi, X.; Sun, Y.; et al. Cell 2020,182,5972.
(55) Whetton, A. D.; Preston, G. W.; Abubeker, S.; Geifman, N. J.
Proteome Res. 2020,19, 42194232.
(56) Zhang, B.; Zhou, X.; Zhu, C.; et al. Front. Mol. Biosci. 2020,7,
157.
(57) Chen, N.; Zhou, M.; Dong, X.; et al. Lancet 2020,395, 507
513.
(58) Heinrich, P. C.; Castell, J. V.; Andus, T. Biochem. J. 1990,265,
621636.
Analytical Chemistry pubs.acs.org/ac Article
https://dx.doi.org/10.1021/acs.analchem.0c04049
Anal. Chem. 2021, 93, 21912199
2199
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
The pandemic of the coronavirus disease 2019 (COVID-19) has become a global public health crisis. The symptoms of COVID-19 range from mild to severe, but the physiological changes associated with COVID-19 are barely understood. In this study, we performed targeted metabolomic and lipidomic analyses of plasma from a cohort of patients with COVID-19 who had experienced different symptoms. We found that metabolite and lipid alterations exhibit apparent correlation with the course of disease in these patients, indicating that the development of COVID-19 affected their whole-body metabolism. In particular, malic acid of the TCA cycle and carbamoyl phosphate of the urea cycle result in altered energy metabolism and hepatic dysfunction, respectively. It should be noted that carbamoyl phosphate is profoundly down-regulated in patients who died compared with patients with mild symptoms. And, more importantly, guanosine monophosphate (GMP), which is mediated not only by GMP synthase but also by CD39 and CD73, is significantly changed between healthy subjects and patients with COVID-19, as well as between the mild and fatal cases. In addition, dyslipidemia was observed in patients with COVID-19. Overall, the disturbed metabolic patterns have been found to align with the progress and severity of COVID-19. This work provides valuable knowledge about plasma biomarkers associated with COVID-19 and potential therapeutic targets, as well as an important resource for further studies of the pathogenesis of COVID-19.
Article
Full-text available
Introduction: A recently emerging respiratory disease named coronavirus disease 2019 (COVID-19) has quickly spread across the world. This disease is initiated by severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) and uncontrolled cytokine storm, but it remains unknown as to whether a robust antibody response is related to clinical deterioration and poor outcome in COVID-19 patients. Methods: Anti-SARS-CoV-2 IgG and IgM antibodies were determined by chemiluminescence analysis (CLIA) in COVID-19 patients at a single center in Wuhan. Median IgG and IgM levels in acute and convalescent-phase sera (within 35 days) for all included patients were calculated and compared between severe and non-severe patients. Immune response phenotyping based on the late IgG levels and neutrophil-to-lymphocyte ratio (NLR) was characterized to stratified patients into different disease severities and outcomes. Results: A total of 222 patients were included in this study. IgG was first detected on day 4 of illness, and its peak levels occurred in the fourth week. Severe cases were more frequently found in patients with high IgG levels, compared to those with low IgG levels (51.8 vs. 32.3%; p = 0.008). Severity rates for patients with NLRhiIgGhi, NLRhiIgGlo, NLRloIgGhi, and NLRloIgGlo phenotype were 72.3, 48.5, 33.3, and 15.6%, respectively (p < 0.0001). Furthermore, severe patients with NLRhiIgGhi, NLRhiIgGlo had higher inflammatory cytokines levels including IL-2, IL-6 and IL-10, and decreased CD4+ T cell count compared to those with NLRloIgGlo phenotype (p < 0.05). Recovery rates for severe patients with NLRhiIgGhi, NLRhiIgGlo, NLRloIgGhi, and NLRloIgGlo phenotype were 58.8% (20/34), 68.8% (11/16), 80.0% (4/5), and 100% (12/12), respectively (p = 0.0592). Dead cases only occurred in NLRhiIgGhi and NLRhiIgGlo phenotypes. Conclusions: COVID-19 severity is associated with increased IgG response, and an immune response phenotyping based on the late IgG response and NLR could act as a simple complementary tool to discriminate between severe and non-severe COVID-19 patients, and further predict their clinical outcome.
Article
Full-text available
Background Coronavirus disease (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is currently a pandemic affecting over 200 countries. Many cities have established designated fever clinics to triage suspected COVID-19 patients from other patients with similar symptoms. However, given the limited availability of the nucleic acid test as well as long waiting time for both the test and radiographic examination, the quarantine or therapeutic decisions for a large number of mixed patients were often not made in time. We aimed to identify simple and quickly available laboratory biomarkers to facilitate effective triage at the fever clinics for sorting suspected COVID-19 patients from those with COVID-19-like symptoms. Methods We collected clinical, etiological, and laboratory data of 989 patients who visited the Fever Clinic at Wuhan Union Hospital, Wuhan, China, from Jan 31 to Feb 21. Based on polymerase chain reaction (PCR) nucleic acid testing for SARS-CoV-2 infection, they were divided into two groups: SARS-CoV-2-positive patients as cases and SARS-CoV-2-negative patients as controls. We compared the clinical features and laboratory findings of the two groups, and analyzed the diagnostic performance of several laboratory parameters in predicting SARS-CoV-2 infection and made relevant comparisons to the China diagnosis guideline of having a normal or decreased number of leukocytes (≤9•5 10⁹/L) or lymphopenia (<1•1 10⁹/L). Findings Normal or decreased number of leukocytes (≤9•5 10⁹/L), lymphopenia (<1•1 10⁹/L), eosinopenia (<0•02 10⁹/L), and elevated hs-CRP (≥4 mg/L) were presented in 95•0%, 52•2%, 74•7% and 86•7% of COVID-19 patients, much higher than 87•2%, 28•8%, 31•3% and 45•2% of the controls, respectively. The eosinopenia produced a sensitivity of 74•7% and specificity of 68•7% for separating the two groups with the area under the curve (AUC) of 0•717. The combination of eosinopenia and elevated hs-CRP yielded a sensitivity of 67•9% and specificity of 78•2% (AUC=0•730). The addition of eosinopenia alone or the combination of eosinopenia and elevated hs-CRP into the guideline-recommended diagnostic parameters for COVID-19 improved the predictive capacity with higher than zero of both net reclassification improvement (NRI) and integrated discrimination improvement (IDI). Interpretation The combination of eosinopenia and elevated hs-CRP can effectively triage suspected COVID-19 patients from other patients attending the fever clinic with COVID-19-like initial symptoms. This finding would be particularly useful for designing triage strategies in an epidemic region having a large number of patients with COVID-19 and other respiratory diseases while limited medical resources for nucleic acid tests and radiographic examination. Funding This work was supported by the National Natural Science Foundation of China (NSFC) and the Major Scientific and Technological Innovation Projects of Hubei Province (MSTIP).
Article
Full-text available
We report acute antibody responses to SARS-CoV-2 in 285 patients with COVID-19. Within 19 days after symptom onset, 100% of patients tested positive for antiviral immunoglobulin-G (IgG). Seroconversion for IgG and IgM occurred simultaneously or sequentially. Both IgG and IgM titers plateaued within 6 days after seroconversion. Serological testing may be helpful for the diagnosis of suspected patients with negative RT–PCR results and for the identification of asymptomatic infections. A cross-sectional study of hospitalized patients with COVID-19 and a longitudinal follow-up study of patients with COVID-19 suggest that SARS-CoV2-specific IgG or IgM seroconversion occurs within 20 days post symptom onset.
Article
Full-text available
Coronavirus disease 2019 (COVID-19) is an acute respiratory tract infection that emerged in late 20191,2. Initial outbreaks in China involved 13.8% cases with severe, and 6.1% with critical courses³. This severe presentation corresponds to the usage of a virus receptor that is expressed predominantly in the lung2,4. By causing an early onset of severe symptoms, this same receptor tropism is thought to have determined pathogenicity, but also aided the control, of severe acute respiratory syndrome (SARS) in 2003⁵. However, there are reports of COVID-19 cases with mild upper respiratory tract symptoms, suggesting the potential for pre- or oligosymptomatic transmission6–8. There is an urgent need for information on body site-specific virus replication, immunity, and infectivity. Here we provide a detailed virological analysis of nine cases, providing proof of active virus replication in upper respiratory tract tissues. Pharyngeal virus shedding was very high during the first week of symptoms (peak at 7.11 × 10⁸ RNA copies per throat swab, day 4). Infectious virus was readily isolated from throat- and lung-derived samples, but not from stool samples, in spite of high virus RNA concentration. Blood and urine never yielded virus. Active replication in the throat was confirmed by viral replicative RNA intermediates in throat samples. Sequence-distinct virus populations were consistently detected in throat and lung samples from the same patient, proving independent replication. Shedding of viral RNA from sputum outlasted the end of symptoms. Seroconversion occurred after 7 days in 50% of patients (14 days in all), but was not followed by a rapid decline in viral load. COVID-19 can present as a mild upper respiratory tract illness. Active virus replication in the upper respiratory tract puts the prospects of COVID-19 containment in perspective.
Article
Full-text available
Background: The novel coronavirus SARS-CoV-2 is a newly emerging virus. The antibody response in infected patient remains largely unknown, and the clinical values of antibody testing have not been fully demonstrated. Methods: A total of 173 patients with SARS-CoV-2 infection were enrolled. Their serial plasma samples (n=535) collected during the hospitalization were tested for total antibodies (Ab), IgM and IgG against SARS-CoV-2. The dynamics of antibodies with the disease progress was analyzed. Results: Among 173 patients, the seroconversion rate for Ab, IgM and IgG was 93.1%, 82.7% and 64.7%, respectively. The reason for the negative antibody findings in 12 patients might due to the lack of blood samples at the later stage of illness. The median seroconversion time for Ab, IgM and then IgG were day-11, day-12 and day-14, separately. The presence of antibodies was <40% among patients within 1-week since onset, and rapidly increased to 100.0% (Ab), 94.3% (IgM) and 79.8% (IgG) since day-15 after onset. In contrast, RNA detectability decreased from 66.7% (58/87) in samples collected before day-7 to 45.5% (25/55) during day 15-39. Combining RNA and antibody detections significantly improved the sensitivity of pathogenic diagnosis for COVID-19 (p<0.001), even in early phase of 1-week since onset (p=0.007). Moreover, a higher titer of Ab was independently associated with a worse clinical classification (p=0.006). Conclusions: The antibody detection offers vital clinical information during the course of SARS-CoV-2 infection. The findings provide strong empirical support for the routine application of serological testing in the diagnosis and management of COVID-19 patients.
Article
Full-text available
Background: Emergence of coronavirus disease 2019 (COVID-19) is a major healthcare threat. Current method of detection involves qPCR-based technique, which identifies the viral nucleic acids when present in sufficient quantity. False negative results can be achieved and failure to quarantine the infected patient would be a major setback in containing the viral transmission. We here aim to describe the time kinetics of various antibodies produced against the 2019 novel coronavirus (SARS-CoV-2) and evaluate the potential of antibody testing to diagnose COVID-19. Methods: The host humoral response against SARS-CoV-2 including IgA, IgM and IgG response were examined by using an ELISA based assay on the recombinant viral nucleocapsid protein. Total 208 plasma samples were collected from 82 confirmed and 58 probable cases (qPCR negative but had typical manifestation). The diagnostic value of IgM was evaluated in this cohort. Results: The median duration of IgM and IgA antibody detection were 5 days (IQR 3-6), while IgG was detected on 14 days (IQR 10-18) after symptom onset, with a positive rate of 85.4%, 92.7% and 77.9% respectively. In confirmed and probable cases, the positive rates of IgM antibodies were 75.6% and 93.1%, respectively. The detection efficiency by IgM ELISA is higher than that of qPCR method after 5.5 days of symptom onset. The positive detection rate is significantly increased (98.6%) when combined IgM ELISA assay with PCR for each patient compare with a single qPCR test (51.9%). Conclusions: Humoral response to SARS-CoV-2 can aid to the diagnosis of COVID-19, including subclinical cases.
Article
The emergence of novel coronavirus disease 2019 (COVID-19), caused by the SARS-CoV-2 coronavirus, has necessitated the urgent development of new diagnostic and therapeutic strategies. Rapid research and development, on an international scale, has already generated assays for detecting SARS-CoV-2 RNA and host immunoglobulins. However, the complexities of COVID-19 are such that a fuller definition of patient status, trajectory, sequelae and responses to therapy is now required. There is accumulating evidence – from studies of both COVID-19 and the related disease SARS – that protein biomarkers could help to provide this definition. Proteins associated with blood coagulation (D-dimer), cell damage (lactate dehydrogenase) and the inflammatory response (e.g. C-reactive protein) have already been identified as possible predictors of COVID-19 severity or mortality. Proteomics technologies, with their ability to detect many proteins per analysis, have begun to extend these early findings. In order to be effective, proteomics strategies must include not only methods for comprehensive data acquisition (e.g. using mass spectrometry) but also informatics approaches via which to derive actionable information from large data-sets. Here we review applications of proteomics to COVID-19 and SARS, and outline how pipelines involving technologies such as artificial intelligence could be of value for research on these diseases.
Article
Early detection and effective treatment of severe COVID-19 patients remain major challenges. Here, we performed proteomic and metabolomic profiling of sera from 46 COVID-19 and 53 control individuals. We then trained a machine learning model using proteomic and metabolomic measurements from a training cohort of 18 non-severe and 13 severe patients. The model was validated using ten independent patients, seven of which were correctly classified. Targeted proteomics and metabolomics assays were employed to further validate this molecular classifier in a second test cohort of 19 new COVID-19 patients, leading to 16 correct assignments. We identified molecular changes in the sera of COVID-19 patients compared to other groups implicating dysregulation of macrophage, platelet degranulation and complement system pathways, and massive metabolic suppression. This study revealed characteristic protein and metabolite changes in the sera of severe COVID-19 patients, which might be used in selection of potential blood biomarkers for severity evaluation.
Article
Background: Since December 2019, novel coronavirus (SARS-CoV-2)-infected pneumonia (COVID-19) occurred in Wuhan, and rapidly spread throughout China. We aimed to clarify the characteristics and clinical significance of peripheral lymphocyte subset alteration in COVID-19. Methods: The levels of peripheral lymphocyte subsets were measured by flow cytometry in 60 hospitalized COVID-19 patients before and after treatment, and their association with clinical characteristics and treatment efficacy was analyzed. Results: Total lymphocytes, CD4+ T cells, CD8+ T cells, B cells and natural killer (NK) cells decreased in COVID-19 patients, and severe cases had a lower level than mild cases. The subsets showed a significant association with the inflammatory status in COVID-19, especially CD8+ T cells and CD4+/CD8+ ratio. After treatment, 37 patients (67%) reached clinical response, with an increase of CD8+ T cells and B cells. No significant change of any subset was detected in non-response cases. In multivariate analysis, post-treatment decrease of CD8+ T cells and B cells and increase of CD4+/CD8+ ratio were indicated as independent predictors for poor efficacy. Conclusions: Peripheral lymphocyte subset alteration was associated with the clinical characteristics and treatment efficacy of COVID-19. CD8+ T cells tended to be an independent predictor for COVID-19 severity and treatment efficacy.