Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Try the next-generation Data Catalog at catalog-beta.data.gov and help shape it with your feedback.

Data from: Predictive Modeling of Seed Biochemicals Composition Traits using Machine learning algorithms

Metadata Updated: January 6, 2026

Sorghum is a versatile cereal crop grown the central U.S. especially in Kansas and Texas and is an important grain for feed, biofuel, and food markets. The growing demand for sorghum has increased the need for enhanced hybrids with superior grain composition, such as high protein and starch content, which are key determinants of its nutritional and economic value. The composition of sorghum grain is highly influenced by genetics, growth conditions, and crop management practices, all of which influence the final yield and quality of the crop. To develop new varieties of sorghum with improved grain traits, grain composition must be measured on large sample sets grown at multiple locations and often with different crop practices, which is currently a labor-intensive and time-consuming process. Thus, this research investigated the ability of machine learning to predict plant growth features and grain composition collected by high throughput techniques and determine relationships between crop management practices and grain composition and ultimately end-use value. By identifying optimal variety-management combinations and leveraging non-invasive, high-throughput plant and grain analysis, this study offers a scalable framework for real-time decision-making and targeted field interventions to improve sorghum varieties.

This dataset contains grain composition determined by near-infrared spectroscopy used as part of this research project.

Access & Use Information

Public: This dataset is intended for public access and use. License: Creative Commons CCZero

Downloads & Resources

Dates

Metadata Created Date January 6, 2026
Metadata Updated Date January 6, 2026

Metadata Source

Harvested from USDA JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date January 6, 2026
Metadata Updated Date January 6, 2026
Publisher Agricultural Research Service
Maintainer
Identifier 10.15482/USDA.ADC/30651527.v1
Data Last Modified 2025-12-22
Public Access Level public
Bureau Code 005:18
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id d9535f1a-daf7-4406-9af8-cfbd2fb7e84b
Harvest Source Id d3fafa34-0cb9-48f1-ab1d-5b5fdc783806
Harvest Source Title USDA JSON
License https://creativecommons.org/publicdomain/zero/1.0/
Program Code 005:040
Source Datajson Identifier True
Source Hash a4555d76668950e2fdf103aecd1b67eece2895c143e3dda30976d60d81ed9cde
Source Schema Version 1.1
Temporal 2024-11-19/2025-02-15

Didn't find what you're looking for? Suggest a dataset here.