In 'archive.ics.uci.edu' number of attributes of 'Breast Cancer Wisconsin (Diagnostic) Data Set' is 32 but when downloading it, it has 11 attributes, I cant understand what happen to that 32 attribute?

7 views (last 30 days)
I was searching for breast cancer dataset https://archive.ics.uci.edu/ml/datasets.html and i have found 4 dataset with some different name. I dont know the difference between them and that how attributes come in each dataset?
[Merged from duplicate]
I need breast cancer dataset and I searched it in https://archive.ics.uci.edu/ml/datasets.html i have found four breast cancer dataset. i don't know the difference between them and second problem is that in page https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29 in table number of attributes is 32 but when downloading dataset it has only 11 attribute (in this page https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/breast-cancer-wisconsin.data)
  3 Comments
shadi rashtian
shadi rashtian on 21 Jun 2017
in <ftp://ftp.cs.wisc.edu/math-prog/cpo-dataset/machine-learn/cancer/WDBC/> dataset has 32 attribute. but whats the difference between dataset of this site and https://archive.ics.uci.edu/ml/datasets.html . i want to use it in clustering. which of them should i use?
Walter Roberson
Walter Roberson on 21 Jun 2017
I think you might be looking at the "Original", which has 11 attributes (10 mentioned in the table but 11 if you look at the details, when you include the ID code)
The one with the 32 is the "Diagnostic" set.

Sign in to comment.

Answers (1)

Walter Roberson
Walter Roberson on 21 Jun 2017
data = readtable('WDBC.dat');
gives a table with 32 columns.

Categories

Find more on Biotech and Pharmaceutical in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!