Predictive modeling is a complex science. But what is more frustrating that obtaining poor or no results at all after having invested time and money in a data mining project? At DNAlytics, we clearly understand this. On our side, it is also a pity to have to announce such poor project outcome to our customers. We definitely don't like it. That is why we now propose a very fast evaluation of the potential value of your data, and this for free!
REED (Rapid and Easy Evaluation of Datasets) is a web application which aims at automatically process a dataset in order to get a quick guess of the interest the data represents in terms of predictive modeling and markers identification. The idea is not at all to perform our best work, which cannot be automated, but to give to prospects some hints about their data potential and also some specific issues that the data would contain, and that should be looked at in details. In particular, REED provides:
After the upload, please validate the data import (see left column). You will then receive an email as soon as the results are available.
Expected file format : Comma-separated values
Here is an example of the expected file format. It is a subsamble of the Iris dataset. Spaces are optional.
,"Sepal.Length","Sepal.Width","Petal.Length","Petal.Width","class" "x1", 5.1, 3.5, 1.4, 0.2,"setosa" "x2", 4.9, 3, 1.4, 0.2,"setosa" "x3", 4.7, 3.2, , 0.2,"setosa" "x4", 4.6, 3.1, 1.5, 0.2,"setosa" "x51", 7, 3.2, 4.7, 1.4,"versicolor" "x52", 6.4, 3.2, 4.5, 1.5,"versicolor" "x53", 6.9, 3.1, 4.9, 1.5,"versicolor" "x54", 5.5, 2.3, 4, 1.3,"versicolor"
Some demo datasets are available in the Help section.
We provide here three public datasets that can directly be tested in REED.
This online application offers a raw estimation of standard metrics in a context of predictive analytics, based on data uploaded by the User. Obtaining validated estimations is a much longer and tailor-made process that cannot really be automatized. DNAlytics does not support any claim about the results validity, in particular in terms of generalization capability or biomarker identification robustness. DNAlytics will not be liable for any use that would be made of the offered results. To the contrary, DNAlytics recommends not to use these results as granted before a more extensive analysis of the data set has been performed. The aim of this application is purely to get an rough estimation of the interest there might be in pursuing such detailed analysis. Even in that case, DNAlytics does not guarantee that poor results provided by this application are definitive, and does not guarantee neither that encouraging results provided by this application is a proof of the interest the data might represent. The User is responsible for the data processing and has to be entitled to perform such processing.
The User retains all its rights on the uploaded data. DNAlytics will not make use of the uploaded data outside of the scope of the automated REED analysis, unless otherwise specified by the User.
This application is provided at no cost for any User willing to get a first idea of the value of his data. In that context, one is only granted the following use:
Version: DNA-REED-4.2.1