Drug_induced_Autoimmunity_Prediction

Donated on 1/5/2025

This dataset comprises molecular descriptors generated using RDKit, specifically curated for the study of drug-induced autoimmunity through ensemble machine learning approaches. It is divided into a training set and a testing set, containing numerical features that represent molecular properties and structural characteristics of drugs. The dataset supports predictive modeling tasks aimed at identifying potential autoimmune risks associated with drug candidates. These molecular descriptors include physicochemical properties, providing a comprehensive foundation for machine learning analysis. The dataset facilitates the development of interpretable models for drug toxicity prediction, contributing to advancements in computational toxicology and drug safety assessment.

Dataset Characteristics

Tabular

Subject Area

Health and Medicine

Associated Tasks

Classification

Feature Type

Categorical

# Instances

477

# Features

195

Dataset Information

Has Missing Values?

No

Variables Table

Variable NameRoleTypeDescriptionUnitsMissing Values
LabelTargetBinaryyes
SMILESIDCategoricalyes
BalabanJFeatureContinuousno
BertzCTFeatureContinuousno
Chi0FeatureContinuousno
Chi0nFeatureContinuousno
Chi0vFeatureContinuousno
Chi1FeatureContinuousno
Chi1nFeatureContinuousno
Chi1vFeatureContinuousno

0 to 10 of 197

Additional Variable Information

Class Labels

1: DIA-positive drugs 0: DIA-negative drugs

Dataset Files

FileSize
DIA_trainingset_RDKit_descriptors.csv350.9 KB
DIA_testset_RDKit_descriptors.csv90.6 KB
RDKit_ChemDes.xlsx20 KB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (158.1 KB)
1 citations
3682 views

Creators

Xiaojie Huang

huangxj46@mail3.sysu.edu.cn

Jieyang People's Hospital

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy