Department of Agriculture

http://www.usda.gov/wps/portal/usda/usdahome

Milestone 5 - November 30th 2014

OMB Review Complete: OMB has completed the agency review for this milestone. Agencies should contact their OMB desk officer if anything looks incorrect.

Leading Indicators

These indicators are reviewed by the Office of Management and Budget

Review Status complete
Reviewer Jamie Berryhill (
Last Updated January 12, 2015, 11:58 am EST by paulOMB

Assessment Summary

USDA continues to expand its data with 9.9% growth since last quarter and good program represenation. However, there is no indication of an improvement in schedule milestones and a description of where the agency stands. It appears USDA has much more room for growth. A Google search of site:usda.gov filetype:xls OR filetype:csv OR filetype:xlsx OR filetype:xml returned 109,000 results. While not a perfect measure, this number suggests USDA is a very data heavy agency with further room for growth. Also, nearly 40%of USDA's bureaus are not represented in the EDI.

BEST PRACTICE: USDA provides several transparent methods for 2-way customer feedback and communication, as described at http://www.usda.gov/wps/portal/usda/usdahome?navid=DIGITALSTRATEGY, including a GitHub site. The GitHub page could be enhanced slightly by making is clear that the feedback is solicited for data sets that are not APIs, as one could interpret that this repo only pertains to APIs.

Inventory Composition

Public Dataset Status

Status Indicator Automated Metrics
Overall Progress this Milestone
Inventory Updated this Quarter
645 Number of Datasets
Number of APIs
Schedule Delivered Crawl details
17/27 Bureaus represented
29/60 Programs represented
493 Number of public datasets
6 Number of restricted public datasets
146 Number of non-public datasets
Inventory > Public listing
9.9 Percentage growth in records since last quarter
Schedule Risk for Nov 30, 2014
Spot Check - datasets listed by search engine
Agency provides a public Enterprise Data Inventory on Data.gov
License specified Crawl details
Status Indicator Automated Metrics
Overall Progress this Milestone
499 Number of Datasets Crawl details
Number of Collections Crawl details
496 Number of Public Datasets with File Downloads Crawl details
Number of APIs Crawl details
Total number of access and download links Crawl details
Quality Check: Links are sufficiently working Crawl details
Quality Check: Accessible links Crawl details
Quality Check: Redirected links Crawl details
Quality Check: Error links Crawl details
Quality Check: Broken links Crawl details
7.4 Percentage growth in records since last quarter
100 Valid Metadata Crawl details
/data exists Crawl details
/data.json Crawl details
Harvested by data.gov
21700 Views on data.gov for the quarter

Best Practice: Department of Agriculture has been highlighted for demonstrating a best practice on the Public Engagement indicator

Status Indicator Automated Metrics
Overall Progress this Milestone
Description of feedback mechanism delivered Crawl details
Data release is prioritized through public engagement
Feedback loop is closed, 2 way communication
See below Link to or description of Feedback Mechanism
http://www.usda.gov/wps/portal/usda/usdahome?navid=DIGITALSTRATEGY
Status Indicator Automated Metrics
Overall Progress this Milestone
Data Publication Process Delivered Crawl details
Information that should not to be made public is documented with agency's OGC
Status Indicator Automated Metrics
Overall Progress this Milestone
See below Open Data Primary Point of Contact
Peter Rhee peter.rhee@oc.usda.gov
POCs identified for required responsibilities
Status Indicator Automated Metrics
Overall Progress this Milestone
Identified 5 data improvements for this quarter
Primary Uses
Value or impact of data
See below Primary data discovery channels
Not provided (there is text in the field, but it's copied from the value and impact question)
See below User suggestions on improving data usability
Could use more specifics.
See below User suggestions on additional data releases
Could use more specifics.
Digital Analytics Program on /data

Automated Metrics

These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot

data.json
Expected Data.json URL http://www.usda.gov/data.json (From USA.gov Directory)
Resolved Data.json URL http://www.usda.gov/data.json
Number of Redirects
HTTP Status 200
Content Type text/plain
Valid JSON Valid
Datasets with Valid Metadata 100%(499 of 499)
Valid Schema Valid
Datasets 499
Datasets with Distribution URLs 0.0% (0 of 499)
Total Distribution URLs 0
Public Datasets 493
Restricted Public Datasets 6
Non-public Datasets 0
Bureaus Represented 19
Programs Represented 29
File Size 836.89KB
Last modified Friday, 28-Nov-2014 09:54:28 EST
Last crawl Saturday, 29-Nov-2014 23:00:30 EST
Analyze archive copies Analyze archive from 2014-11-30
/data page
Expected /data URL http://www.usda.gov/data (From USA.gov Directory)
Resolved /data URL http://www.usda.gov/wps/portal/usda/usdahome?navid=data
Redirects 1 redirects
HTTP Status 200
Content Type text/html; charset=UTF-8
Last crawl Saturday, 29-Nov-2014 23:00:32 EST
/digitalstrategy.json
Expected /digitalstrategy.json URL http://www.usda.gov/digitalstrategy.json (From USA.gov Directory)
Resolved /digitalstrategy.json URL http://www.usda.gov/digitalstrategy.json
Redirects
HTTP Status 200
Content Type text/plain
Valid JSON Valid
Last modified Thursday, 20-Feb-2014 14:57:02 EST
Last crawl Saturday, 29-Nov-2014 23:00:33 EST
Digital Strategy

Date specified: Wednesday, 11-Dec-2013 06:42:19 EST

Date of digitalstrategy.json file: Thursday, 20-Feb-2014 14:57:02 EST

1.2.4 Develop Data Inventory Schedule - Summary

Summarize the Inventory Schedule


Over the past several months, the USDA has focused its Open Data efforts on establishing a framework to enhance, enrich, and open, to the extent practicable, its Enterprise Data Inventory (EDI) and to ensure the Department and its component Agencies are prepared to identify,
correctly document, and submit to the Office of Management and Budget (OMB) on November 30, 2014 a comprehensive EDI. In so doing, the USDA has already achieved several internal milestones that lay the groundwork for the Department's future Open Data efforts and position the USDA to meet OMB's Open Data requirements. The following milestones are among the Department's recent Open Data achievements:

1.2.5 Develop Data Inventory Schedule - Milestones

TitleMilestone 1 - Expand and Open EDI
Description
Milestone DateFebruary 28, 2014
Description of how this milestone expands the InventoryThe USDA will initially focus on expanding and opening its Enterprise Data Inventory. To this end, the Open Data Council, in collaboration with Agency CIOs, Data Stewards, and the Open Data Working Group (ODWG), will continue to work with the USDA's Privacy Officers to identify, prioritize and submit additional public, non-public, and restricted-public datasets to the Department. In addition, the ODC and ODWG will continue the development of an Open Data process and guidance that expedite the publication of its datasets in the future. At the end of Q1, the USDA will submit an updated EDI to OMB.
Description of how this milestone enriches the Inventory
Description of how this milestone opens the Inventory
TitleMilestone 2 - Enrich EDI and Public Datasets
Description
Milestone DateMay 31, 2014
Description of how this milestone expands the InventoryOver the second quarter, from February 28 to May 31, the USDA and its component agencies will continue to input into its EDI additional datasets. The Department will also continue to collaborate with its Data Stewards to update its customer feedback and outreach efforts. At the end of Q2, the USDA will submit an updated EDI to OMB.
Description of how this milestone enriches the Inventory
Description of how this milestone opens the Inventory
TitleMilestone 3 - Expand, Enrich, and Open EDI
Description
Milestone DateAugust 30, 2014
Description of how this milestone expands the InventoryIn the third quarter of the Open Data effort, the USDA will review and analyze its current dataset inventory, publication process, and customer feedback mechanisms. The results of the analysis will inform whether the Department needs to adjust its Open Data strategy and will help the Department prioritize the submission and release of its datasets in Q4. At the end of Q3, the USDA will submit an updated EDI to OMB.
Description of how this milestone enriches the Inventory
Description of how this milestone opens the Inventory
TitleMilestone 4 - Perform Quality Assurance and Submit Complete EDI to OMB
Description
Milestone DateNovember 1, 2014
Description of how this milestone expands the InventoryCustomer Feedback
Description of how this milestone enriches the Inventory
Description of how this milestone opens the Inventory

1.2.6 Develop Customer Feedback Process

Describe the agency's process to engage with customers



1.2.7 Develop Data Publication Process

Describe the agency's data publication process


The Department of Agriculture (USDA) implemented a four step data publication process.* This multi-step dataset review process involves multiple internal stakeholders such as Data Stewards, Chief Information Officers, Privacy Officers, agency legal staff, and Information Security System Program Managers, Records Managers, and Controlled Unclassified Information managers. The purpose of this multi-perspective approach is to ensure that datasets are adequately reviewed and approved prior to release.