{"accessLevel": "public", "bureauCode": ["010:12"], "contactPoint": {"@type": "vcard:Contact", "fn": "Jennifer C Murphy", "hasEmail": "mailto:jmurphy@usgs.gov"}, "description": "This data release contains one dataset and one model archive in support of the journal article \"Leveraging machine learning to automate regression model evaluations for large multi-site water-quality trend studies\" by Jennifer C. Murphy and Jeffrey G. Chanat. The model archive contains scripts (run in R) to reproduce the four machine learning models (logistic regression, linear and quadratic discriminant analysis, and k-nearest neighbors) trained and tested as part of the journal article. The dataset contains the estimated probabilities for each of these models when applied to a training and test dataset.", "distribution": [{"@type": "dcat:Distribution", "accessURL": "https://doi.org/10.5066/P9GNEN8S", "description": "Landing page for access to the data", "format": "XML", "mediaType": "application/http", "title": "Digital Data"}, {"@type": "dcat:Distribution", "description": "The metadata original format", "downloadURL": "https://data.usgs.gov/datacatalog/metadata/USGS.647a3349d34eac007b521f2d.xml", "format": "XML", "mediaType": "text/xml", "title": "Original Metadata"}], "identifier": "http://datainventory.doi.gov/id/dataset/USGS_647a3349d34eac007b521f2d", "keyword": ["biota", "logistic regression", "k-nearest neighbors", "quadratic discriminant analysis", "USGS:647a3349d34eac007b521f2d", "United States of America", "linear discriminant analysis", "Delaware River Basin"], "modified": "2023-10-04T00:00:00Z", "publisher": {"@type": "org:Organization", "name": "U.S. Geological Survey"}, "spatial": "-127.7930, 24.0465, -64.6875, 49.8380", "theme": ["geospatial"], "title": "Data to support Leveraging machine learning to automate regression model evaluations for large multi-site water-quality trend studies"}