
Drug_induced_Autoimmunity_Prediction
Donated on 1/5/2025
This dataset comprises molecular descriptors generated using RDKit, specifically curated for the study of drug-induced autoimmunity through ensemble machine learning approaches. It is divided into a training set and a testing set, containing numerical features that represent molecular properties and structural characteristics of drugs. The dataset supports predictive modeling tasks aimed at identifying potential autoimmune risks associated with drug candidates. These molecular descriptors include physicochemical properties, providing a comprehensive foundation for machine learning analysis. The dataset facilitates the development of interpretable models for drug toxicity prediction, contributing to advancements in computational toxicology and drug safety assessment.
Dataset Characteristics
Tabular
Subject Area
Health and Medicine
Associated Tasks
Classification
Feature Type
Categorical
# Instances
477
# Features
195
Dataset Information
Has Missing Values?
No
Introductory Paper
By Xiaojie Huang. 2025
Published in Toxicology
Variables Table
Variable Name | Role | Type | Description | Units | Missing Values |
---|---|---|---|---|---|
Label | Target | Binary | yes | ||
SMILES | ID | Categorical | yes | ||
BalabanJ | Feature | Continuous | no | ||
BertzCT | Feature | Continuous | no | ||
Chi0 | Feature | Continuous | no | ||
Chi0n | Feature | Continuous | no | ||
Chi0v | Feature | Continuous | no | ||
Chi1 | Feature | Continuous | no | ||
Chi1n | Feature | Continuous | no | ||
Chi1v | Feature | Continuous | no |
0 to 10 of 197
Additional Variable Information
Class Labels
1: DIA-positive drugs 0: DIA-negative drugs
Dataset Files
File | Size |
---|---|
DIA_trainingset_RDKit_descriptors.csv | 350.9 KB |
DIA_testset_RDKit_descriptors.csv | 90.6 KB |
RDKit_ChemDes.xlsx | 20 KB |
Reviews
There are no reviews for this dataset yet.
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset drug_induced_autoimmunity_prediction = fetch_ucirepo(id=1104) # data (as pandas dataframes) X = drug_induced_autoimmunity_prediction.data.features y = drug_induced_autoimmunity_prediction.data.targets # metadata print(drug_induced_autoimmunity_prediction.metadata) # variable information print(drug_induced_autoimmunity_prediction.variables)
Huang, X. (2025). Drug_induced_Autoimmunity_Prediction [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C5332M.
Keywords
Creators
Xiaojie Huang
huangxj46@mail3.sysu.edu.cn
Jieyang People's Hospital
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.