*********** Forests.nns is a NeuNet Pro Sample File ***********
This file contains special properties that allow NeuNet Pro to recognize it
as authorized NeuNet sample data. Anyone using the unlicensed version
of NeuNet Pro is welcome to experiment with this sample data.
Please do not modify this file, or it will lose its status as authorized sample data.
For further information about Neunet Pro and additional sample data,
please visit the NeuNet Pro website at http://www.cormactech.com/neunet
All of this data has been collected from publicly available sources.
CorMac Technologies Inc. does not guarantee the accuracy of the data.
This data is intended solely for experimental purposes.
This data contains 581,012 records and 56 fields.
The included NeuNet Pro SFAM project was trained on a random sample of 32,000 and produces an accuracy of 68% on the remainder.
The original data has been randomly shuffled.
*************** More about the Forests Data *******************
The Forest CoverType dataset
1. Title of Database:
Forest Covertype data
2. Sources:
(a) Original owners of database:
Remote Sensing and GIS Program
Department of Forest Sciences
College of Natural Resources
Colorado State University
Fort Collins, CO 80523
(contact Jock A. Blackard, jblackard/wo_ftcol@fs.fed.us
or Dr. Denis J. Dean, denis@cnr.colostate.edu)
NOTE: Reuse of this database is unlimited with retention of
copyright notice for Jock A. Blackard and Colorado State University.
(b) Donors of database:
Jock A. Blackard (jblackard/wo_ftcol@fs.fed.us)
USDA Forest Service
3825 E. Mulberry
Fort Collins, CO 80524 USA
Dr. Denis J. Dean (denis@cnr.colostate.edu)
Associate Professor
Department of Forest Sciences
Colorado State University
Fort Collins, CO 80523 USA
Dr. Charles W. Anderson (anderson@cs.colostate.edu)
Associate Professor
Department of Computer Science
Colorado State University
Fort Collins, CO 80523 USA
(c) Date donated: August 1998
3. Past Usage:
Blackard, Jock A. 1998. "Comparison of Neural Networks and
Discriminant Analysis in Predicting Forest Cover Types."
Ph.D. dissertation. Department of Forest Sciences.
Colorado State University. Fort Collins, Colorado.
-- Classification performance
-- first 11,340 records used for training data subset
-- next 3,780 records used for validation data subset
-- last 565,892 records used for testing data subset
-- 70% backpropagation
-- 58% Linear Discriminant Analysis
4. Relevant Information Paragraph:
Predicting forest cover type from cartographic variables only
(no remotely sensed data). The actual forest cover type for
a given observation (30 x 30 meter cell) was determined from
US Forest Service (USFS) Region 2 Resource Information System
(RIS) data. Independent variables were derived from data
originally obtained from US Geological Survey (USGS) and
USFS data. Data is in raw form (not scaled) and contains
binary (0 or 1) columns of data for qualitative independent
variables (wilderness areas and soil types).
5. Number of instances (observations): 581,012
6. Number of Attributes: 12 measures, but 54 columns of data
(10 quantitative variables, 4 binary
wilderness areas and 40 binary
soil type variables)
7. Attribute information:
Given is the attribute name, attribute type, the measurement unit and
a brief description. The forest cover type is the classification problem.
The order of this listing corresponds to the order of numerals along the
rows of the database.
Name Data Type Measurement Description
Elevation quantitative meters Elevation in meters
Aspect quantitative azimuth Aspect in degrees azimuth
Slope quantitative degrees Slope in degrees
Horizontal_Distance_To_Hydrology quantitative meters Horz Dist to nearest surface water features
Vertical_Distance_To_Hydrology quantitative meters Vert Dist to nearest surface water features
Horizontal_Distance_To_Roadways quantitative meters Horz Dist to nearest roadway
Hillshade_9am quantitative 0 to 255 index Hillshade index at 9am, summer solstice
Hillshade_Noon quantitative 0 to 255 index Hillshade index at noon, summer soltice
Hillshade_3pm quantitative 0 to 255 index Hillshade index at 3pm, summer solstice
Horizontal_Distance_To_Fire_Points quantitative meters Horz Dist to nearest wildfire ignition points
Wilderness_Area (4 binary columns) qualitative 0 (absence) or 1 (presence) Wilderness area designation
Soil_Type (40 binary columns) qualitative 0 (absence) or 1 (presence) Soil Type designation
Cover_Type (7 types) integer 1 to 7 Forest Cover Type designation
Code Designations:
Wilderness Areas: 1 -- Rawah Wilderness Area
2 -- Neota Wilderness Area
3 -- Comanche Peak Wilderness Area
4 -- Cache la Poudre Wilderness Area
Soil Types: 1 to 40 : based on the USFS Ecological
Landtype Units for this study area.
Forest Cover Types: 1 -- Spruce/Fir
2 -- Lodgepole Pine
3 -- Ponderosa Pine
4 -- Cottonwood/Willow
5 -- Aspen
6 -- Douglas-fir
7 -- Krummholz
NOTE: Summary statistics not included in this documentation.
8. Missing Attribute Values: None.
9. Class distribution:
Number of records of Spruce-Fir: 211840
Number of records of Lodgepole Pine: 283301
Number of records of Ponderosa Pine: 35754
Number of records of Cottonwood/Willow: 2747
Number of records of Aspen: 9493
Number of records of Douglas-fir: 17367
Number of records of Krummholz: 20510
Number of records of other: 0
Total records: 581012
============================================================================================================================================
Jock A. Blackard
8/28/98