Department of Agriculture

http://www.usda.gov/wps/portal/usda/usdahome

Milestone 7 - May 31st 2015

OMB Review Complete: OMB has completed the agency review for this milestone. Agencies should contact their OMB desk officer if anything looks incorrect.

Leading Indicators

These indicators are reviewed by the Office of Management and Budget

Review Status complete
Reviewer Rebecca Williams
Last Updated September 28, 2015, 2:28 pm EDT by Rebecca Williams

Assessment Summary

Enterprise Data Inventory: To be featured as a best practice growth and license improvements are needed. The EDI includes 654 data assets, but a the Google search for .xls, .csv, .xml, and .json produces 224,000 dataset results. While 99.4% of datasets include a labeled license, many are not Public Domain. Licensing information should be reviewed and updated, see: https://project-open-data.cio.gov/open-licenses/ Any dataset that is not available in the Public Domain should include an explanation as to why in the rights field.

Public Data Listing: To be featured as a best practice growth and metadata improvements are needed. There has been only 2% growth and 4 APIs added to the Public Data Listing since last quarter. Add featured datasets: https://www.whitehouse.gov/sites/default/files/microsites/ostp/us_open_data_action_plan.pdf Ensure all APIs are included in the PDL with rich documentation: https://raw.githubusercontent.com/18F/API-All-the-X/gh-pages/_data/individual_apis.yml; https://18f.github.io/API-All-the-X/pages/apis_in_data_catalogs

Public Engagement: To be featured as a best practice going forward, integration with Data.gov, web analytics, and a prioritized schedule based on public feedback is needed. USDA provides several transparent methods for 2-way customer feedback and communication, as described at http://www.usda.gov/wps/portal/usda/usdahome?navid=DIGITALSTRATEGY, including a GitHub site. To integrate more seamlessly with Data.gov, work to incorporate their Help Desk API into your feedback mechanism. A first step would be to to include this data request link on your /data page: https://www.data.gov/data-request/?agency_name=49015 When hosting events be sure to add them to Data.gov/events. While the related Topic www.data.gov/food received 18,213 views last quarter, it is unclear how many views usda.gov/data obtained, to remedy this work with the DAP team: http://www.digitalgov.gov/services/dap/ Lastly, develop a public schedule to improve the open maturity of your data based on public feedback. Two U.S. local examples include: https://cityofphiladelphia.github.io/slash-data/census/; http://montgomerycountymd.gov/open/Resources/Files/OpenDataImplementationPlan_FY14.pdf; See also: https://cio.gov/wp-content/uploads/filebase/cio_document_library/Open Data Prioritization Toolkit Summary.html

Privacy and Security: To be featured as a best practice, work with your OGC to ensure all "restricted public" and "non-public" datasets are included in the EDI with accessLevel explanations included in the rights field. If any metadata requires redaction, ensure that is redacted precisely, with a presumption for openness: https://project-open-data.cio.gov/redactions/

Human Capital: To be featured as a best practice growth in Open Data dedicated staff time and detailed data governance is needed.

Use & Impact: To be featured as a best practice more detailed examples are needed featuring how USDA's open data is used to achieve: cost savings, efficiency, fuel for business, improved civic services, informed policy, performance planning, research and scientific discoveries, transparency and accountability, and increased public participation, etc.

Inventory Composition

Public Dataset Status

Dataset Link Quality

Status Indicator Automated Metrics
Overall Progress this Milestone
Inventory Updated this Quarter
653 Number of Datasets
68 Number of APIs
Schedule Delivered Crawl details
18 Bureaus represented
28 Programs represented
503 Number of public datasets
3 Number of restricted public datasets
145 Number of non-public datasets
Inventory > Public listing
0% Percentage growth in records since last quarter
128,000 Spot Check - datasets listed by search engine
Agency provides a public Enterprise Data Inventory on Data.gov
99.4% License specified Crawl details

Public Data Listing: To be featured as a best practice growth and metadata improvements are needed. There has been only 2% growth and 4 APIs added to the Public Data Listing since last quarter. Add featured datasets: https://www.whitehouse.gov/sites/default/files/microsites/ostp/us_open_data_action_plan.pdf Ensure all APIs are included in the PDL with rich documentation: https://raw.githubusercontent.com/18F/API-All-the-X/gh-pages/_data/individual_apis.yml; https://18f.github.io/API-All-the-X/pages/apis_in_data_catalogs

Status Indicator Automated Metrics
Overall Progress this Milestone
516 Number of Datasets Crawl details
Number of Collections Crawl details
497 Number of Public Datasets with File Downloads Crawl details
68 Number of APIs Crawl details
654 Total number of access and download links Crawl details
Quality Check: Links are sufficiently working Crawl details
493 Quality Check: Accessible links Crawl details
50 Quality Check: Redirected links Crawl details
89 Quality Check: Error links Crawl details
14 Quality Check: Broken links Crawl details
2% Percentage growth in records since last quarter
100% Valid Metadata Crawl details
/data exists Crawl details
/data.json Crawl details
Harvested by data.gov
18,213 Views on data.gov for the quarter
Status Indicator Automated Metrics
Overall Progress this Milestone
Description of feedback mechanism delivered Crawl details
Data release is prioritized through public engagement
Feedback loop is closed, 2 way communication
See below Link to or description of Feedback Mechanism
http://www.usda.gov/wps/portal/usda/usdahome?navid=DIGITALSTRATEGY
Status Indicator Automated Metrics
Overall Progress this Milestone
Data Publication Process Delivered Crawl details
Information that should not to be made public is documented with agency's OGC
Status Indicator Automated Metrics
Overall Progress this Milestone
See below Open Data Primary Point of Contact
bobby.jones@ocio.usda.gov
POCs identified for required responsibilities
Status Indicator Automated Metrics
Overall Progress this Milestone
Identified 5 data improvements for this quarter
See below Primary Uses
USDA data has been used to give app developers and designers direct access to the wealth of farmer’s market information housed in the online database. It has also been used by exporters/importers, Foreign Agricultural Service (FAS) staff, and other government agencies to assess how competitive a product will be in a market as a result of applied import tariffs. In addition, it is used to detect climate variability and change information for cereal crop producers.
Value or impact of data
See below Primary data discovery channels
The value of supplying USDA data has significantly reduced the time to research pertinent data for major projects and entrepreneur solutions. It has fostered a “build once use multiple times” atmosphere by eliminating the cost of reproducing data that already exist. It has reduced the time-to-market on many products and services. It has provided researchers with the ability to partner with other federal agencies, industries and academia partners to produce maps, tables, charts and other useful artifacts to produce predictive analysis on their crop yield and soil conditions.
See below User suggestions on improving data usability
One suggestion received was for USDA to structure the data, maybe by using a database approach, so that the data can be easily searchable. Users have commented that Data.gov is very massive and is not very good when it comes to finding desired datasets. A suggestion was to categorize the datasets (ie - climate, academic, government/NGO and then the description).
See below User suggestions on additional data releases
Key users have requested climate data and disaster relief data. USDA is also taking an aggressive approach in providing administrative data per OMB’s M-14-06. In addition, Farmers, Ranchers and Equipment Manufactures have requested a need for Common Land Unit Data to be more accessible.
Digital Analytics Program on /data

Automated Metrics

These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot

data.json
Expected Data.json URL http://www.usda.gov/data.json (From USA.gov Directory)
Resolved Data.json URL http://www.usda.gov/data.json
Number of Redirects
HTTP Status 200
Content Type text/plain
Valid JSON Valid
Detected Data.json Schema federal-v1.1
Datasets with Valid Metadata 100%(516 of 516)
Valid Schema Valid
Datasets 516
Datasets with Distribution URLs 100% (516 of 516)
Datasets with Download URLs 96.3% (497 of 516)
Total Distribution URLs 654
Total Download URLs 572
Total APIs 68
Public Datasets 513
Restricted Public Datasets 3
Non-public Datasets 0
Bureaus Represented 17
Programs Represented 29
License Specified 99.4% (513 of 516)
Datasets with Redactions 0.0% (0 of 516)
Redactions without explanation (rights field) 0.0% (0 of 516)
File Size 889.98KB
Last modified Thursday, 28-May-2015 11:12:40 EDT
Last crawl Sunday, 31-May-2015 00:00:36 EDT
Analyze archive copies Analyze archive from 2015-05-31
Nearby Daily Crawls
/data page
Expected /data URL http://www.usda.gov/data (From USA.gov Directory)
Resolved /data URL http://www.usda.gov/wps/portal/usda/usdahome?navid=data
Redirects 1 redirects
HTTP Status 200
Content Type text/html; charset=UTF-8
Last crawl Sunday, 31-May-2015 00:00:25 EDT
/digitalstrategy.json
Expected /digitalstrategy.json URL http://www.usda.gov/digitalstrategy.json (From USA.gov Directory)
Resolved /digitalstrategy.json URL http://www.usda.gov/digitalstrategy.json
Redirects
HTTP Status 200
Content Type text/plain
Valid JSON Valid
Last modified Thursday, 20-Feb-2014 14:57:02 EST
Last crawl Sunday, 31-May-2015 00:00:25 EDT
Digital Strategy

Date specified: Wednesday, 11-Dec-2013 06:42:19 EST

Date of digitalstrategy.json file: Thursday, 20-Feb-2014 14:57:02 EST

1.2.4 Develop Data Inventory Schedule - Summary

Summarize the Inventory Schedule


Over the past several months, the USDA has focused its Open Data efforts on establishing a framework to enhance, enrich, and open, to the extent practicable, its Enterprise Data Inventory (EDI) and to ensure the Department and its component Agencies are prepared to identify,
correctly document, and submit to the Office of Management and Budget (OMB) on November 30, 2014 a comprehensive EDI. In so doing, the USDA has already achieved several internal milestones that lay the groundwork for the Department's future Open Data efforts and position the USDA to meet OMB's Open Data requirements. The following milestones are among the Department's recent Open Data achievements:

1.2.5 Develop Data Inventory Schedule - Milestones

TitleMilestone 1 - Expand and Open EDI
Description
Milestone DateFebruary 28, 2014
Description of how this milestone expands the InventoryThe USDA will initially focus on expanding and opening its Enterprise Data Inventory. To this end, the Open Data Council, in collaboration with Agency CIOs, Data Stewards, and the Open Data Working Group (ODWG), will continue to work with the USDA's Privacy Officers to identify, prioritize and submit additional public, non-public, and restricted-public datasets to the Department. In addition, the ODC and ODWG will continue the development of an Open Data process and guidance that expedite the publication of its datasets in the future. At the end of Q1, the USDA will submit an updated EDI to OMB.
Description of how this milestone enriches the Inventory
Description of how this milestone opens the Inventory
TitleMilestone 2 - Enrich EDI and Public Datasets
Description
Milestone DateMay 31, 2014
Description of how this milestone expands the InventoryOver the second quarter, from February 28 to May 31, the USDA and its component agencies will continue to input into its EDI additional datasets. The Department will also continue to collaborate with its Data Stewards to update its customer feedback and outreach efforts. At the end of Q2, the USDA will submit an updated EDI to OMB.
Description of how this milestone enriches the Inventory
Description of how this milestone opens the Inventory
TitleMilestone 3 - Expand, Enrich, and Open EDI
Description
Milestone DateAugust 30, 2014
Description of how this milestone expands the InventoryIn the third quarter of the Open Data effort, the USDA will review and analyze its current dataset inventory, publication process, and customer feedback mechanisms. The results of the analysis will inform whether the Department needs to adjust its Open Data strategy and will help the Department prioritize the submission and release of its datasets in Q4. At the end of Q3, the USDA will submit an updated EDI to OMB.
Description of how this milestone enriches the Inventory
Description of how this milestone opens the Inventory
TitleMilestone 4 - Perform Quality Assurance and Submit Complete EDI to OMB
Description
Milestone DateNovember 1, 2014
Description of how this milestone expands the InventoryCustomer Feedback
Description of how this milestone enriches the Inventory
Description of how this milestone opens the Inventory

1.2.6 Develop Customer Feedback Process

Describe the agency's process to engage with customers



1.2.7 Develop Data Publication Process

Describe the agency's data publication process


The Department of Agriculture (USDA) implemented a four step data publication process.* This multi-step dataset review process involves multiple internal stakeholders such as Data Stewards, Chief Information Officers, Privacy Officers, agency legal staff, and Information Security System Program Managers, Records Managers, and Controlled Unclassified Information managers. The purpose of this multi-perspective approach is to ensure that datasets are adequately reviewed and approved prior to release.