摘要
Cyclooxygenase-2 (COX-2) is an enzyme that plays a crucial role in inflammation by converting arachidonic acid into prostaglandins. The overexpression of enzyme is associated with conditions such as cancer, arthritis, and Alzheimer's disease (AD), where it contributes to neuroinflammation. In silico virtual screening is pivotal in early-stage drug discovery; however, the absence of coding or machine learning expertise can impede the development of reliable computational models capable of accurately predicting inhibitor compounds based on their chemical structure. In this study, we developed an automated KNIME workflow for predicting the COX-2 inhibitory potential of novel molecules by building a multi-level ensemble model constructed with five machine learning algorithms (i.e., Logistic Regression, K-Nearest Neighbors, Decision Tree, Random Forest, and Extreme Gradient Boosting) and various molecular and fingerprint descriptors (i.e., AtomPair, Avalon, MACCS, Morgan, RDKit, and Pattern). Post-applicability domain filtering, the final majority voting-based ensemble model achieved 90.0% balanced accuracy, 87.7% precision, and 86.4% recall on the external validation set. The freely accessible workflow empowers users to swiftly and effortlessly predict COX-2 inhibitors, eliminating the need for any prior knowledge in machine learning, coding, or statistical modeling, significantly broadening its accessibility. While beginners can seamlessly use the tool as is, experienced KNIME users can leverage it as a foundation to build advanced workflows, driving further research and innovation.