Skip to main content

Using time-series chest radiographs and laboratory data by machine learning for identifying pulmonary infection and colonization of Acinetobacter baumannii

Abstract

Background

Accurately distinguishing between pulmonary infection and colonization in patients with Acinetobacter baumannii is of utmost importance to optimize treatment and prevent antibiotic abuse or inadequate therapy. An efficient automated sorting tool could prompt individualized interventions and enhance overall patient outcomes. This study aims to develop a robust machine learning classification model using a combination of time-series chest radiographs and laboratory data to accurately classify pulmonary status caused by Acinetobacter baumannii.

Methods

We proposed nested logistic regression models based on different time-series data to automatically classify the pulmonary status of patients with Acinetobacter baumannii. Advanced features were extracted from the time-series data of hospitalized patients, encompassing dynamic pneumonia indicators observed on chest radiographs and laboratory indicator values recorded at three specific time points.

Results

Data of 152 patients with Acinetobacter baumannii cultured from sputum or alveolar lavage fluid were retrospectively analyzed. Our model with multiple time-series data demonstrated a higher performance of AUC (0.850, with a 95% confidence interval of [0.638–0.873]), an accuracy of 0.761, a sensitivity of 0.833. The model, which only incorporated a single time point feature, achieved an AUC of 0.741. The influential model variables included difference in the chest radiograph pneumonia score.

Conclusion

Dynamic assessment of time-series chest radiographs and laboratory data using machine learning allowed for accurate classification of colonization and infection with Acinetobacter baumannii. This demonstrates the potential to help clinicians provide individualized treatment through early detection.

Background

Acinetobacter baumannii (A. baumannii) is a significant nosocomial pathogen and a growing public health concern [1]. It has emerged as a frequent causative agent of lower respiratory tract infections in critically ill patients, known for its rapid acquisition of drug resistance and pan-drug-resistant phenotypes [2,3,4]. Strains of this bacterial species can cause lower respiratory tract infection while also colonizing the region asymptomatically [5,6,7,8]. Identifying the presence of colonization and infection is crucial for guiding initial antibiotic therapy and the implementation of isolation measures in order to prevent transmission of A. baumannii [9, 10]. Nevertheless, it is challenging to distinguish between the two. It is not always clear whether the patient is being colonized or infected at any given time, particularly in those who are immunosuppressed or at high risk, such as transplant patients, those with malignancies, or those receiving corticosteroids, owing to a suppressed inflammatory response [11].

Previous published studies have employed retrospective or prospective research methods to analyze the risk factors associated with colonization or infection [12,13,14,15,16]. Most studies indicated that parameters associated with infection by A. baumannii are the admission at ICU, the number of days of hospitalization, mechanical ventilation and antibiotic treatment. However, previous published studies have only analyzed data based on single-time clinical information when the strain was initially cultured. However, some clinical indicators of the hospitalized patients changed during this period, such as inflammatory indicators [17]. Dynamic vital laboratory data can provide valuable insights beyond individual data points. In addition, continuous observation of chest radiographs reflects changes in inflammation in the lungs [18]. Leveraging the full potential of these dynamic factors was expected to be advantageous for distinguishing between infection and colonization. There is a demand for a more objective, efficient, and intelligent approach to handling this time-dependent dataset with a linear relationship.

In recent years, the use of machine learning in healthcare has garnered increasing attention, particularly in areas such as lesion prediction, personalized patient treatment, and objective evaluation of patient conditions [19,20,21]. For instance, Wang at al. proposed an innovative Lasso Logistic Regression model that utilizes feature-based time series data to determine the optimal timing for drug administration or escalating intervention procedures in COVID-19 patients [19]. However, there have been limited studies focusing specifically on the application of machine learning in assessing the pulmonary status of A. baumannii. Therefore, the objective of this study is to effectively leverage the extensive data collected from various tests conducted during patient hospitalization to develop a machine learning classification model capable of accurately identifying A. baumannii pulmonary status. Additionally, we sought to extract key features from the time-series profiles of chest radiographs and laboratory data that significantly influenced the progression of infection.

Methods

Study design

Figure 1 depicts the study design. In this retrospective study, we examined the initial clinical characteristics, laboratory data, and chest radiograph data of patients with A. baumannii isolated from sputum or alveolar lavage fluid on more than two consecutive occasions. The study received approval from the Ethics Committee of Nanfang Hospital, and the need for informed consent was waived given the retrospective nature of the study. This study was conducted in compliance with the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) guidelines [22].

Fig. 1
figure 1

The overall design of the study. (a) Retrospective collection of baseline clinical information as well as time-series chest radiographs and laboratory indicators. Quantitative assessment of chest radiographs abnormalities, specifically through the quantitative scoring of chest radiographs by radiologists, aiming to detect and diagnose pneumonia more accurately. (b) We constructed four nested logistic regression models for classifying pulmonary infection and colonization of A. baumannii combined difference clinical characteristic. (c) Model performance was assessed using AUC, decision curve analysis and calibration curve. (d) We adopted Shapley additive explanation (SHAP) values to determine which features contributed most to model predictions on the logistic regression predictions. T1: within 1 day of admission; T2, 3 days before culturing out the strain; T3, 1 day within culturing out the strain. Model 1: clinical baseline information + laboratory indicators and radiographic features of T3. Model 2: model1 + the change value of between T3 and T1. Model 3: model 1 + the change value of between T3 and T2. Model 4: model 2 + model 3. ROC: receiver operating characteristic curves. DCA: decision curve analysis

Study participants and data collection

We retrospectively enrolled 152 patients with A. baumannii cultured in the sputum or alveolar lavage fluid more than two consecutive times. Diagnoses of A. baumannii infection or colonization were confirmed by infectious disease specialists based on “Consensus of the Chinese specialists for diagnosis, treatment & control of Acinetobacter baumannii infection.” [23].

Patients with lower respiratory tract chest abnormalities resulting from noninfectious causes such as pulmonary embolism, pulmonary edema, lung cancer, and other conditions were excluded. Patients whose strain culture was performed within 3 days of hospitalization were also excluded. Chest radiographs were obtained from these patients within 1 day of A. baumannii culture. All conventional radiographic images were obtained using the hospital’s picture archiving and communication system. Baseline clinical information and chest radiographs were collected to create a dedicated database.

Data related to various categories were collected for analysis. This included baseline clinical information such as demographic characteristics (age and sex), preoperative comorbidities, invasive procedures, use of Antibiotics and glucocorticoids, hospitalization details, and time-series data. The time-series data consisted of three phases: T1 (within 1 day of admission), T2 (3 days before strain culture), and T3 (within 1 day of strain culture). Time-series data included chest radiographs and laboratory test results for serum inflammatory indicators, liver function, kidney function, and electrolytes. The baseline data used for the analysis were obtained at T3. To compare changes over time, the difference between the baseline data and the other two time points was calculated for each time-series datapoint. Additionally, the Acute Physiology and Chronic Health Evaluation (APACHE) II score was calculated within 1 day of strain isolation. More detail sees supplemental material.

Imaging analysis: quantification of chest radiograph abnormalities

Chest radiographs were acquired through both portable and nonportable imaging devices, using both anteroposterior and posteroanterior projections. All time-series chest radiographs were independently and simultaneously reviewed by two experienced radiologists (J. Wu and J. Lin, with 9 and 15 years of diagnostic imaging experience, respectively) who were blinded to the clinical characteristics, laboratory findings, and patient outcomes. After independent evaluation, the radiologists resolved any disagreements with discussion and consensus. Serial chest radiographs were analyzed and compared to evaluate the progression, stability, or improvement of lung abnormalities throughout the course of the illness. For each patient diagnosed with pneumonia, the predominant chest radiographic features, as defined by the Fleischner Society Glossary [24] included ground glass opacity (GGO), consolidation, pleural thickening, adhesions, and pleural effusion. Additional information regarding the image interpretation process is available in the supplementary materials.

The severity score for each lung was calculated using the Radiographic Assessment of Lung Edema (RALE) score, as proposed by Warren et al. [25]. To quantitatively assess the extent of pulmonary abnormalities, such as ground-glass opacities (GGO) and consolidation, a chest radiograph score was assigned based on the involvement area in each of the six lung fields. The scoring criteria were as follows: 0 for no involvement, 1 for < 5% involvement, 2 for 5–25% involvement, 3 for 25–50% involvement, 4 for 50–75% involvement, and 5 for > 75% involvement. Additionally, a small amount of pleural effusion was given a score of 1, while a large pleural effusion and pleural thickening adhesions were assigned a score of 2. The total pneumonia severity score was calculated by summing up the scores of individual lung fields, resulting in a total score ranging from 0 to 34.

Machine learning model development and evaluation

Classification method. For model development, we used logistic regression, a machine learning algorithm. Logistic regression diagnostics assess the effectiveness of models in capturing the underlying associations between predictors and patient outcomes in the given dataset, whether it is the dataset used for building the model or data from a distinct population [26]. The models were trained using Python (Python Software Foundation, version 3.7.4). The hyperparameters of the models were tuned using grid searching (for logistic regression) with cross-validation of the training set. To prevent overfitting and reduce model complexity, we first implemented the Least Absolute Shrinkage and Selection Operator (LASSO) to filter the features. The filtered features were then applied to the model for training. Missing values for continuous variables such as APACHE II were imputed using mean value.

Comparison models. To evaluate the additive value of dynamic changes in laboratory data and chest radiograph characteristics for classification ability, we generated the following nested logistic regression models that added different input variables to the training set: Model 1 (baseline model), clinical baseline information + laboratory data, and radiographic features at T3; Model 2, Model 1 + the change in value between T3 and T1; Model 3, Model 1 + the change in value between T3 and T2; Model 4 (multiple time-series model), Model 2 + Model 3.

Calibration and decision curve analysis. we assessed prediction performance by computing the net benefit through decision curve analysis (DCA) [27]. DCA integrates important insights into the advantages of accurately prioritizing patients (true positives) and the potential risks of excessive prioritization (false positives), ultimately, presenting a net benefit across a range of threshold probabilities for the outcome (or clinical preference). Furthermore, to enhance usability, we performed score recalibration using a sigmoid function on cross-validation samples and evaluated the probability of pulmonary A. baumannii classification.

Feature importance. To improve the interpretability of our model, we adopted Shapley Additive Explanation (SHAP) [28] values to determine which features contributed the most to model predictions in the logistic regression predictions. The SHAP value measures the contribution of each feature to the assigned infection risk level, either positively or negatively, as determined by the model. We employed an open-source implementation of the SHAP value method for both calculation and visualization purposes.

Statistical analysis

Continuous variables were compared using Student’s t-test, while categorical values were compared using the χ2 test to identify any significant differences between the infection and colonization groups. Mean and standard deviation (SD) or median and interquartile range (IQR) were used to present continuous variables, while categorical variables were expressed as frequency (proportion). The area under the curve (AUC) was utilized as a comprehensive measure of discrimination, and the non-parametric Delong method was used to compare AUCs. To evaluate the model, we employed the bootstrap method to sample 1000 different test sets and obtain a 95% confidence interval (95%CI) for the model evaluation metrics. Additionally, accuracy, sensitivity, and specificity of different models were calculated. All statistical analyses were performed using MedCalc® statistical software (version 20.2; 2011 MedCalc Software bvba, Mariakerke, Belgium). A p-value of < 0.05 was considered statistically significant.

Result

Patient characteristics

A total of 152 patients met the inclusion criteria and were included in the study. Among these patients, 80 were infected and 72 were colonized by the strain of interest. The 152 patients were randomly divided into a training set of 106 patients and a test set of 46 patients. Table 1 summarizes the demographic, clinical, and time-series characteristics of the infection and colonization groups. The average age of the infection and colonization groups was 62(IQR, 53–74 years) and 59 years (IQR, 51–70 years), respectively. Notably, hypoproteinemia, cerebrovascular disease, combined fungal, APACHE II scores, and types of antibiotics used differed significantly between the colonization and infection groups (p < 0.05). Particularly, the APACHE II scores in the infection group were higher than those in the colonization group (median: 14[IQR, 10–19] vs. median: 12[IQR, 8–16], p = 0.006). Moreover, more types of antibiotics were used in the infection group prior to the strain being cultured than in the colonization group (median: 4[IQR, 2–5] vs. median: 3[IQR, 2–4], p = 0.031).

Table 1 Comparison of Demographic, Clinical, Time-series Laboratory data and Imaging Characteristics between Groups

Regarding the time-series data, Δserum creatinine 3−2 and Δpneumonia scores3 − 2 exhibited significant differences between groups (p = 0.044). Patients with infection in the pulmonary had Δpneumonia scores3 − 2 statistically higher than the colonization group (p = 0.001). More notably, the pneumonia score of the infected patients increased, while that of the colonized group decreased from T2 to T3. The remaining time-series characteristics were balanced between the two groups.

Model performance assessments

We calculated the AUC, accuracy, sensitivity, and specificity of the various models for the classification of infection and colonization (Table 2; Fig. 2). The best-performing model was Model 4, with multiple time-series features and an AUC of 0.850(95%CI: 0.638–0.873). In the test set, the baseline model (Model 1, clinical baseline information + laboratory data and radiographic features of T3) had an AUC of 0.741(95%CI: 0.568–0.775), which was significantly improved by adding the change in value between T3 and T2 (Model 3, AUC = 0.845[95%CI: 0.680–0.875], p = 0.021). Adding multiple time-series features further improved the discriminatory power (Model 4, AUC = 0.850 vs. 0.741, p = 0.041). However, adding the change in value between T3 and T1 did not improve classification performance (AUC = 0.720[95%CI: 0.551–0.777] vs. 0.741[95%CI: 0.568–0.775], p = 0.743).

Table 2 The performance of differences models in the classification of infection and colonization
Fig. 2
figure 2

Receiver operating characteristic curves of the four models to classify pulmonary A. baumannii colonization and infection. Model 1, clinical baseline information + laboratory indicators and radiographic features of T3. Model 2, model1 + the change value of between T3 and T1. Model 3, model 1 + the change value of between T3 and T2. Model 4, model 2 + model 3. All statistical comparisons between area under the receiver operating characteristic curve values of models 1–4 were significant (p < 0.05)

The multiple time-series model, which integrates temporal information from longitudinal chest radiographs, showed better performance in classification than single time-point information. In terms of the accuracy, sensitivity, and specificity of the model, when multiple time-series features were added, the accuracy and sensitivity were 17.4% (0.761, 95%CI: 0.621–0.874) and 37.5% (0.833, 95%CI: 0.626–0.953), respectively. Moreover, by additionally incorporating the change in value between T3 and T2, the accuracy improved by 19.6% (0.783; 95%CI: 0.636–0.891), and the sensitivity increased by 37.5% (0.833; 95%CI: 0.626–0.953). Additionally, the AUC of Model 3 was similar to that of model 4 (0.845 vs. 0.850, p = 0.912), whereas the accuracy and specificity of model 3 were higher than those of model 4 (0.783 vs. 0.761 and 0.727 vs. 0.682, respectively).

Clinical use of time-series model to classify Infection and colonization

In terms of clinical utility value, decision curve analysis (DCA) showed that compared with the single time point clinical model, when the threshold probability ranged from approximately 0.10–0.75, the majority of patients benefited from Model 3 and Model 4, suggesting that the addition of the change in value between T3 and T2 or multiple time-series features provides a reliable clinical tool for predicting the status of pulmonary A. baumannii. The DCA based on these four models is shown in Fig. 3. Finally, we evaluated the calibration of the various models for classification. The predicted probabilities of model 2 and model 4 were close to the observed probabilities and showed good calibration (Figure S1).

Fig. 3
figure 3

Decision curve analysis was performed to evaluate the net benefit, with the y-axis representing the net benefit. The optimal prediction to maximize net benefit was determined by identifying the higher curve at any given threshold probability. The results of the decision curve analysis demonstrated that the model utilizing all feature sets provided a greater net benefit compared to other models, highlighting its superior predictive performance

Feature importance

Figure 4 displays the characteristics sorted by the SHAP values for the best-performing model (model 4). Using SHAP analysis, we identified the 15 most informative features in our model. These features include the length of hospital stay, types of antibiotics used, surgical procedures after admission to hospital, temperature3, Δpneumonia score3 − 2, tracheotomy or not, ΔD-dimer3 − 1, APACHE II, Δalbumin3 − 1, combined culturing with other bacteria, CRP, length of hospital stay, Δ thrombocyte3 − 2, transfusion or not, serum creatinine3, Δthrombocyte3 − 1, ΔPCT3 − 1, ΔCRP3 − 2, age and hypoproteinemia. Intuitively, a longer length of hospitalization stay, length of ICU stay, higher Δ pneumonia score3 − 2, APACHE II, temperature, CRP, and the use of more types of antibiotics all contributed to a greater risk of infection, as predicted by the model. In particular, we found that radiographic changes contributed more to the model than changes in laboratory indicators.

Fig. 4
figure 4

The SHAP summary plot illustrates 20 feature clusters, where the values of a specific feature (such as average, minimum, and maximum) are aggregated. The features are arranged in descending order on the y-axis, based on their mean absolute impact on the prediction. The SHAP value for each feature is represented by the distance of the dot from the x-axis at x = 0. A farther distance indicates a greater effect (positive or negative) that the feature had on the machine learning model’s output. The color of the dot corresponds to the original feature values, ranging from low (blue) to high (magenta), as indicated by the color bar

In Fig. 5, the prediction outcomes of various models are depicted for two representative cases. The first is the case of infection that was misclassified as colonization by the baseline model but correctly classified by other time-series models. The second shows another example of colonization that was misclassified as infection by model 3 but correctly classified by model 4 (the multiple time-series model).

Fig. 5
figure 5

Two examples in the test set. On the left are the SHAP’s feature distribution analysis graphs of the highest predicted probability model prediction. The red bar represent the positive contribution to model’s prediction, while the blue bar means the negative contribution to its prediction. On the right are the radiographs of the cases. (a) An example of a 71-year-old female patient with intrahepatic bile duct stones with cholangitis. The strain was detected in the patient’s natural cough sputum 7 days after admission. The predicted probability of model 4 is 0.897. (b) A case with a 58-year-old female patient with glioblastoma of the frontal lobe. The strain was detected in the patient’s natural cough sputum 13 days after admission

Discussion

This study highlights the effectiveness of using machine learning algorithms to accurately classify pulmonary colonization and infection of A. baumannii by integrating clinical time-series imaging and laboratory characteristics. We incorporated nested models to investigate the predictive effectiveness of the models by integrating data on laboratory indicators and pneumonia scores at different time points. Among the different models tested, the multiple time-series model (model 4) showed the most promising results in terms of AUC, sensitivity, decision curve analysis, and calibration curve. Therefore, the multiple time-series model has the potential to be a valuable tool for identifying the pulmonary status of A. baumannii, aiding clinicians in making early adjustments to treatment regimens.

A. baumannii colonizes the respiratory tract by forming biofilms, leading to drug resistance and recurrent infections. This results in resistance to most antibacterial drugs and outbreaks [29]. Although colonization itself may not immediately cause infection, it can weaken the immune system and ultimately leading to infection. Our hypothesis is that changes occur in the patient’s microenvironment during this process, as indicated by fluctuations in inflammatory cell counts, imaging results, and other related indicators. One of the unique aspects of our model is that its inclusion of data not only at the time of the initial strain culture, but also at admission and 3 days before the first culture, while considering the changes in each index value. To capture the dynamic changes induced by A. baumannii, we developed a nested machine learning model based on laboratory data and chest radiographs at three different time points. As expected, our study demonstrates the additional value of dynamic change data for distinguishing between colonization and infection. The model’s expressive capability was enhanced with the addition of data from all three time points, resulting in improved performance. In comparison to models based on a single time point or two time points, the multiple time-series model achieved superior performance (AUC: 0.850 vs. 0.741 vs. 845 vs. 720), and exhibits high accuracy and sensitivity (0.761[95%CI: 0.612–0.874], 0.833[95%CI: 0.626–0.953], respectively). The top five dynamic variables identified by the model, which made significant contributions, namely Δpneumonia score3 − 2, Δalbumin3 − 1, ΔD-dimer3 − 1, Δthrombocyte3 − 1, and Δthrombocyte3 − 2.

In comparison to other lung infections, A. baumannii-induced lung changes exhibit atypical characteristics. To gain deeper insights into the potential patterns of lung alterations caused by A. baumannii and to further explore its diagnostic value in distinguishing between infection and colonization, we incorporated chest X-ray images from multiple time points. Through the analysis of these sequential X-ray images, our goal is to uncover the evolution and distinctive features of lung lesions resulting from A. baumannii infection. This comprehensive understanding will contribute to improved knowledge of the developmental dynamics associated with this infection, enabling clinicians to establish a more accurate foundation for diagnosis and treatment. Pneumonia was defined as the presence of new or progressive pulmonary infiltrates on chest radiographs [18]. For hospitalized patients, particularly those in the ICU, routine chest radiography is common. Our findings demonstrated that one imaging metric emerged as one of the top five contributors to the overall prediction accuracy of the model. Specifically, the most significant imaging factor for classification was the difference in chest radiograph pneumonia scores between the day of the initial culture of the strain and 3 days before culture. We observed that pneumonia in infected patients progressed within 1 day of the strain culture, in contrast to 3 days before culture, where the colonization group showed a reduction in pneumonia, with a statistically significant difference. This result implies that the increase in the chest radiograph pneumonia score from 3 days prior to the culture of the strain to the culture of the strain could serve as a more indicative sign of infection. This information is currently concealed within the trends observed for chest radiograph pneumonia. By promptly detecting early warning signs, we can effectively differentiate between colonization and infection and administer early treatment to infected patients. This approach holds the potential to provide timely assistance and improve patient outcomes.

SHAP values were applied to determine which features contributed the most to model predictions in the logistic regression predictions and calculate the degree of contribution of each significant feature to the model. The higher accuracy of multiple time-series models can be attributed to the inclusion of additional distinct variables, especially laboratory values such as the difference in albumin levels between T3 and T1, D-dimer, thrombocytes, CRP, and WBC count. These findings highlight the untapped potential of using multiple time-point laboratory data as potential biomarkers for precise classification of infection and colonization.

Regarding clinical applicability, the multiple time-series model (model 4) has the potential to guide individual therapies against A. baumannii from cultured sputum or alveolar lavage fluid. For patients with infections, the therapy should be targeted based on drug sensitivity results, to reduce the multiplication of bacteria in the body. In this study, 21 of 24 (87%) infected patients who could benefit from adjusting treatment regimens were successfully identified using multiple time-series model.

Our multiple time-series model holds great potential in assisting clinicians, particularly young doctors, surgeons, and intensivists, in identifying the status of A. baumannii in the lungs. Additionally, the model will play a significant role in preventing and controlling nosocomial infections, making it an important criterion for clinical research enrollment. However, this study has several limitations. Firstly, the quantitative analysis of pneumonia scores was subjective and not computer-assisted, which may introduce potential bias. Additionally, the current study is limited to a retrospective analysis conducted at a single center with a relatively small sample size. Therefore, conducting a larger-scale study is necessary to validate the clinical utility and generalizability of the model before its widespread application in clinical practice. Future research should consider expanding the study to multiple centers and increasing the sample size to ensure greater accuracy and reliability of the model’s results. Collaborating with other medical institutions in conducting prospective studies across different clinical settings would provide a comprehensive evaluation of the model’s applicability. Through rigorous validation and clinical application, we can gain a better understanding of the model’s clinical efficacy. By means of additional validation, we can enhance our comprehension of the clinical usefulness of the model, and furnish physicians with more practical and convenient decision support for identifying A. baumannii status and implementing measures for nosocomial infection prevention and control.

Conclusion

In conclusion, the proposed model, incorporating time-series chest radiographs and laboratory data, shows promise in the early detection of A. baumannii infection and colonization. We envision a potential implementation strategy in clinical practice. For high-risk patients in respiratory and critical care, neurosurgery, and critical care medicine departments, regular chest radiographs and monitoring of laboratory indicators should be conducted, with close attention to any dynamic changes in the results. In the presence of detected A. baumannii strains, the model can provide risk estimation and recommend personalized treatment strategies. However, further large-scale prospective studies are necessary to validate the effectiveness of this approach.

Data Availability

The datasets used and/or analysed during the current study are available from

the corresponding author on reasonable request.

References

  1. Giammanco A, Calà C, Fasciana T, Dowzicky MJ. Global Assessment of the activity of Tigecycline against Multidrug-Resistant Gram-negative pathogens between 2004 and 2014 as part of the Tigecycline evaluation and Surveillance Trial. mSphere. 2017;2:e00310–16.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Garnacho-Montero J, Amaya-Villar R. Multiresistant Acinetobacter baumannii Infections: epidemiology and management. Curr Opin Infect Dis. 2010;23:332–9.

    Article  PubMed  Google Scholar 

  3. Rolain J-M, Diene SM, Kempf M, Gimenez G, Robert C, Raoult D. Real-time sequencing to decipher the molecular mechanism of resistance of a clinical pan-drug-resistant Acinetobacter baumannii isolate from Marseille, France. Antimicrob Agents Chemother. 2013;57:592–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Weiner LM, Webb AK, Limbago B, Dudeck MA, Patel J, Kallen AJ, et al. Antimicrobial-resistant pathogens Associated with Healthcare-Associated Infections: Summary of Data reported to the National Healthcare Safety Network at the Centers for Disease Control and Prevention, 2011–2014. Infect Control Hosp Epidemiol. 2016;37:1288–301.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Horan TC, Andrus M, Dudeck MA. CDC/NHSN surveillance definition of health care-associated Infection and criteria for specific types of Infections in the acute care setting. Am J Infect Control. 2008;36:309–32.

    Article  PubMed  Google Scholar 

  6. Koulenti D, Tsigou E, Rello J. Nosocomial Pneumonia in 27 ICUs in Europe: perspectives from the EU-VAP/CAP study. Eur J Clin Microbiol Infect Dis. 2017;36:1999–2006.

    Article  CAS  PubMed  Google Scholar 

  7. Whiteway C, Breine A, Philippe C, Van der Henst C. Acinetobacter baumannii. Trends Microbiol. 2022;30:199–200.

    Article  CAS  PubMed  Google Scholar 

  8. Zhang T, Xu X, Xu C-F, Bilya SR, Xu W. Mechanical ventilation-associated Pneumonia caused by Acinetobacter baumannii in Northeast China region: analysis of genotype and drug resistance of bacteria and patients’ clinical features over 7 years. Antimicrob Resist Infect Control. 2021;10:135.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Nseir S, Grailles G, Soury-Lavergne A, Minacori F, Alves I, Durocher A. Accuracy of American Thoracic Society/Infectious Diseases Society of America criteria in predicting Infection or colonization with multidrug-resistant bacteria at intensive-care unit admission. Clin Microbiol Infect. 2010;16:902–8.

    Article  CAS  PubMed  Google Scholar 

  10. Al-Hashimy ZS, Conway BR, Al-Yaqoobi M, Khamis F, Al Mawali GZ, Al Maashani AM, et al. Identifying targets for antibiotic use for the management of Carbapenem-Resistant Acinetobacter baumannii (CRAb) in Hospitals-A Multi-centre Nonlinear Time-Series Study. Antibiot (Basel). 2022;11:775.

    Article  CAS  Google Scholar 

  11. Bartal C, Rolston KVI, Nesher L. Carbapenem-resistant Acinetobacter baumannii: colonization, Infection and current treatment options. Infect Dis Ther. 2022;11:683–94.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Tacconelli E, Cataldo MA, De Pascale G, Manno D, Spanu T, Cambieri A, et al. Prediction models to identify hospitalized patients at risk of being colonized or infected with multidrug-resistant Acinetobacter baumannii calcoaceticus complex. J Antimicrob Chemother. 2008;62:1130–7.

    Article  CAS  PubMed  Google Scholar 

  13. Martín-Aspas A, Guerrero-Sánchez FM, García-Colchero F, Rodríguez-Roca S, Girón-González J-A. Differential characteristics of Acinetobacter baumannii colonization and Infection: risk factors, clinical picture, and mortality. Infect Drug Resist. 2018;11:861–72.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Chaari A, Mnif B, Bahloul M, Mahjoubi F, Chtara K, Turki O, et al. Acinetobacter baumannii ventilator-associated Pneumonia: epidemiology, clinical characteristics, and prognosis factors. Int J Infect Dis. 2013;17:e1225–1228.

    Article  PubMed  Google Scholar 

  15. Munier A-L, Biard L, Legrand M, Rousseau C, Lafaurie M, Donay J-L, et al. Incidence, risk factors and outcome of multi-drug resistant Acinetobacter baumannii nosocomial Infections during an outbreak in a burn unit. Int J Infect Dis. 2019;79:179–84.

    Article  PubMed  Google Scholar 

  16. Boral B, Unaldi Ö, Ergin A, Durmaz R, Eser ÖK, Acinetobacter Study Group. A prospective multicenter study on the evaluation of antimicrobial resistance and molecular epidemiology of multidrug-resistant Acinetobacter baumannii Infections in intensive care units with clinical and environmental features. Ann Clin Microbiol Antimicrob. 2019;18:19.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Liu Z, Xu W. Neutrophil and macrophage response in Acinetobacter Baumannii Infection and their relationship to Lung Injury. Front Cell Infect Microbiol. 2022;12:890511.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Kalil AC, Metersky ML, Klompas M, Muscedere J, Sweeney DA, Palmer LB, et al. Management of adults with hospital-acquired and ventilator-associated Pneumonia: 2016 clinical practice guidelines by the Infectious Diseases Society of America and the American Thoracic Society. Clin Infect Dis. 2016;63:e61–111.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Wang L, Yin Z, Puppala M, Ezeana C, Wong K, He T, et al. A time-series feature-based recursive classification model to optimize treatment strategies for improving outcomes and resource allocations of COVID-19 patients. IEEE J Biomed Health Inform. 2022;26:3323–9.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Gould MK, Huang BZ, Tammemagi MC, Kinar Y, Shiff R. Machine learning for early Lung Cancer Identification using Routine Clinical and Laboratory Data. Am J Respir Crit Care Med. 2021;204:445–53.

    Article  PubMed  Google Scholar 

  21. Liu Y, Wang Y, Wang Y, Xie Y, Cui Y, Feng S, et al. Early prediction of treatment response to neoadjuvant chemotherapy based on longitudinal ultrasound images of HER2-positive Breast cancer patients by siamese multi-task network: a multicentre, retrospective cohort study. EClinicalMedicine. 2022;52:101562.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162:W1–73.

    Article  PubMed  Google Scholar 

  23. Chen B, He L, Hu B. Consensus of the Chinese specialists for diagnosis, treatment & control of acinetobacter baumannii Infection. Zhonghua Yi Xue Za Zhi. 2012;92:76–85.

    PubMed  Google Scholar 

  24. Hansell DM, Bankier AA, MacMahon H, McLoud TC, Müller NL, Remy J. Fleischner Society: Glossary of terms for thoracic imaging. Radiology. 2008;246:697–722.

    Article  PubMed  Google Scholar 

  25. Warren MA, Zhao Z, Koyama T, Bastarache JA, Shaver CM, Semler MW, et al. Severity scoring of lung oedema on the chest radiograph is associated with clinical outcomes in ARDS. Thorax. 2018;73:840–6.

    Article  PubMed  Google Scholar 

  26. Meurer WJ, Tolles J. Logistic Regression Diagnostics: understanding how well a Model predicts outcomes. JAMA. 2017;317:1068–9.

    Article  PubMed  Google Scholar 

  27. Van Calster B, Wynants L, Verbeek JFM, Verbakel JY, Christodoulou E, Vickers AJ, et al. Reporting and interpreting decision curve analysis: a guide for investigators. Eur Urol. 2018;74:796–804.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during Surgery. Nat Biomed Eng. 2018;2:749–60.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Harding CM, Hennon SW, Feldman MF. Uncovering the mechanisms of Acinetobacter baumannii virulence. Nat Rev Microbiol. 2018;16:91–102.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We thank all study subjects and collaborating clinicians for participating and contributing to the study.

Funding

This work was supported by National Natural Science Foundation of China [grant number 821719]; Wu Jieping Medical Foundation [grant number 320.6750.2021-06-36]; Basic and Applied Basic Research Foundation of Guangdong Province [grant number 2022A1515220196]; and Science and Technology Projects in Guangzhou [grant number 2023A04J2400].

Author information

Authors and Affiliations

Authors

Contributions

Study conception, design: Z. Zeng, J. Wu, H. Zhou, G. Qin, L. Liu, D. Yu, C. Qi, W. Chen.Data analysis and validation: Z. Zeng, G. Qin, W. Zeng, J. Lin. Manuscript drafting or manuscript revision for important intellectual content: Z. Zeng, L. Liu, D. Yu, Z. He, W. Zeng, C. Qi, W. Chen. All authors have read, edited, and approved the final manuscript.

Corresponding authors

Correspondence to Laiyu Liu, Chunxia Qi or Weiguo Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Disclosure

No potential conflict of interest was reported by the author(s).

Ethical approval

The institutional review board of Nanfang Hospital, Southern Medical University approval was obtained. The requirement for informed consent were waived by the committee, mainly due to the retrospective nature of this study.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zeng, Z., Wu, J., Qin, G. et al. Using time-series chest radiographs and laboratory data by machine learning for identifying pulmonary infection and colonization of Acinetobacter baumannii. Respir Res 25, 2 (2024). https://0-doi-org.brum.beds.ac.uk/10.1186/s12931-023-02624-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12931-023-02624-x

Keywords