Department of Agriculture
Enterprise Data Inventory - Volume and composition over time
M-13-13 Milestone 10 - February 29th 2016
OMB Review Complete: OMB has completed the agency review for this milestone. Agencies should contact their OMB desk officer if anything looks incorrect.
Leading Indicators
These indicators are reviewed by the Office of Management and Budget
Review Status | complete |
---|---|
Reviewer | Justin Grimes |
Last Updated | April 18, 2016, 1:26 pm EDT by Justin Grimes |
Assessment Summary
Agency had insufficient link quality this quarter (20% or more broken and error links); Agency fails to document all outstanding Licensing information in EDI; Agency Enterprise Data Inventory is not the same as Public Data Listing (data.json).
Inventory Composition
Public Dataset Status
Dataset Link Quality
Status | Indicator | Automated Metrics | ||
---|---|---|---|---|
Overall Progress this Milestone | ||||
Inventory Updated this Quarter | ||||
784 | Number of Datasets | |||
89 | Number of APIs | |||
17 | Bureaus represented | |||
60.7% | Percentage of bureaus represented | |||
30 | Programs represented | |||
50.0% | Percentage of programs represented | |||
616 | Number of public datasets | |||
3 | Number of restricted public datasets | |||
165 | Number of non-public datasets | |||
2.2% | Percentage growth in records since last quarter | |||
To some extent (25-50%) | To what extent is your agency’s Enterprise Data Inventory (EDI) complete? | |||
See below | What steps have you taken to ensure your Enterprise Data Inventory is complete | |||
We continue to work through the USDA Open Data Council and Data Stewards Working group to identify and support new categories of datasets to add to USDA Enterprise Data Inventory. | ||||
Agency provides a public Enterprise Data Inventory on Data.gov | ||||
Agency provided updated Enterprise Data Inventory to OMB | ||||
99.5% | License specified | Crawl details | ||
Number of datasets with redactions | ||||
0% | Percent of datasets with redactions |
Status | Indicator | Automated Metrics |
---|---|---|
Overall Progress this Milestone | ||
625 | Number of Datasets | Crawl details |
1 | Number of Collections | Crawl details |
603 | Number of datasets not contained in a collection | Crawl details |
624 | Number of Public Datasets with File Downloads | Crawl details |
86 | Number of APIs | Crawl details |
Number of public APIs | Crawl details | |
Number of restricted public APIs | Crawl details | |
Number of non-public APIs | Crawl details | |
1161 | Total number of access and download links | Crawl details |
Quality Check: Links are sufficiently working | Crawl details | |
767 | Quality Check: Accessible links | Crawl details |
152 | Quality Check: Redirected links | Crawl details |
4 | Quality Check: Error links | Crawl details |
235 | Quality Check: Broken links | Crawl details |
87.7% | Quality Check: Percentage of download links in correct format as specified in metadata | Crawl details |
32.3% | Quality Check: Percentage of download links in HTML | Crawl details |
11.9% | Quality Check: Percentage of download links in PDF | Crawl details |
Percentage growth in records since last quarter | ||
100% | Valid Metadata | Crawl details |
/data exists | Crawl details | |
Provides datasets in human-readable form on /data | ||
/data.json | Crawl details | |
Harvested by data.gov | ||
622 | Number of public datasets | Crawl details |
3 | Number of restricted public datasets | Crawl details |
Number of non-public datasets | Crawl details | |
1.6% | Percent growth of public datasets | |
Percent growth of restricted public datasets | ||
Percent growth of non-public datasets | ||
Percent datasets licensed as U.S. Public Domain | ||
Percent datasets licensed as Creative Commons Zero | ||
Percent datasets with other licenses | ||
Percent datasets with no license |
Status | Indicator | Automated Metrics | ||
---|---|---|---|---|
Overall Progress this Milestone | ||||
Description of feedback mechanism delivered | Crawl details | |||
Data release is prioritized through public engagement | ||||
Provided narrative evidence of data improvements based on public feedback this quarter | ||||
Feedback loop is closed, 2 way communication | ||||
See below | Link to or description of Feedback Mechanism | |||
https://github.com/USDA/USDA-APIs/issues | ||||
Provides valid contact point information for all datasets |
Status | Indicator | Automated Metrics | ||
---|---|---|---|---|
Overall Progress this Milestone | ||||
Data Publication Process Delivered | Crawl details | |||
Information that should not to be made public is documented with agency's OGC | ||||
See below | Describe the agency's data publication process | |||
The Department of Agriculture (USDA) implemented a four step data publication process.* This multi-step dataset review process involves multiple internal stakeholders such as Data Stewards, Chief Information Officers, Privacy Officers, agency legal staff, and Information Security System Program Managers, Records Managers, and Controlled Unclassified Information managers. The purpose of this multi-perspective approach is to ensure that datasets are adequately reviewed and approved prior to release. |
Best Practice: Department of Agriculture has been highlighted for demonstrating a best practice on the Human Capital indicator
Automated Metrics
These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot
data.json
Expected Data.json URL | http://www.usda.gov/data.json (From USA.gov Directory) |
---|---|
Resolved Data.json URL | http://www.usda.gov/data.json |
Number of Redirects | |
HTTP Status | 200 |
Content Type | text/plain |
Valid JSON | Valid |
Detected Data.json Schema | federal-v1.1 |
Datasets with Valid Metadata | 100%(625 of 625) |
Valid Schema | Valid |
Datasets | 625 |
Number of Collections | 1 |
Number of datasets not in a collection | 603 |
Datasets with Distribution URLs | 99.8% (624 of 625) |
Datasets with Download URLs | 93.3% (583 of 625) |
Total Distribution URLs | 1161 |
Total Download URLs | 1036 |
Total APIs | 86 |
Public Datasets | 622 |
Restricted Public Datasets | 3 |
Non-public Datasets | 0 |
Normally there would be a set of quality assurance fields here to verify that the download links included within the metadata are functioning properly, but the results of those tests are not currently available. | |
Bureaus Represented | 17 |
Programs Represented | 30 |
License Specified | 99.5% (622 of 625) |
Datasets with Redactions | 0.0% (0 of 625) |
Redactions without explanation (rights field) | 0.0% (0 of 625) |
File Size | 1.24MB |
Last modified | Monday, 01-Feb-2016 09:48:38 EST |
Last crawl | Monday, 29-Feb-2016 23:00:38 EST |
Analyze archive copies | Analyze archive from 2016-02-29 |
Nearby Daily Crawls |
Expected /data URL | http://www.usda.gov/data (From USA.gov Directory) |
---|---|
Resolved /data URL | http://www.usda.gov/wps/portal/usda/usdahome?navid=data |
Redirects | 1 redirects |
HTTP Status | 200 |
Content Type | text/html; charset=UTF-8 |
Last crawl | Monday, 29-Feb-2016 23:00:30 EST |
Expected /digitalstrategy.json URL | http://www.usda.gov/digitalstrategy.json (From USA.gov Directory) |
---|---|
Resolved /digitalstrategy.json URL | http://www.usda.gov/digitalstrategy.json |
Redirects | |
HTTP Status | 200 |
Content Type | text/plain |
Valid JSON | Valid |
Last modified | Thursday, 20-Feb-2014 14:57:02 EST |
Last crawl | Monday, 29-Feb-2016 23:00:30 EST |
Date specified: Wednesday, 11-Dec-2013 06:42:19 EST
Date of digitalstrategy.json file: Thursday, 20-Feb-2014 14:57:02 EST1.2.4 Develop Data Inventory Schedule - Summary
Summarize the Inventory Schedule
Over the past several months, the USDA has focused its Open Data efforts on establishing a framework to enhance, enrich, and open, to the extent practicable, its Enterprise Data Inventory (EDI) and to ensure the Department and its component Agencies are prepared to identify, correctly document, and submit to the Office of Management and Budget (OMB) on November 30, 2014 a comprehensive EDI. In so doing, the USDA has already achieved several internal milestones that lay the groundwork for the Department's future Open Data efforts and position the USDA to meet OMB's Open Data requirements. The following milestones are among the Department's recent Open Data achievements:
1.2.5 Develop Data Inventory Schedule - Milestones
Title | Milestone 1 - Expand and Open EDI |
---|---|
Description | |
Milestone Date | February 28, 2014 |
Description of how this milestone expands the Inventory | The USDA will initially focus on expanding and opening its Enterprise Data Inventory. To this end, the Open Data Council, in collaboration with Agency CIOs, Data Stewards, and the Open Data Working Group (ODWG), will continue to work with the USDA's Privacy Officers to identify, prioritize and submit additional public, non-public, and restricted-public datasets to the Department. In addition, the ODC and ODWG will continue the development of an Open Data process and guidance that expedite the publication of its datasets in the future. At the end of Q1, the USDA will submit an updated EDI to OMB. |
Description of how this milestone enriches the Inventory | |
Description of how this milestone opens the Inventory |
Title | Milestone 2 - Enrich EDI and Public Datasets |
---|---|
Description | |
Milestone Date | May 31, 2014 |
Description of how this milestone expands the Inventory | Over the second quarter, from February 28 to May 31, the USDA and its component agencies will continue to input into its EDI additional datasets. The Department will also continue to collaborate with its Data Stewards to update its customer feedback and outreach efforts. At the end of Q2, the USDA will submit an updated EDI to OMB. |
Description of how this milestone enriches the Inventory | |
Description of how this milestone opens the Inventory |
Title | Milestone 3 - Expand, Enrich, and Open EDI |
---|---|
Description | |
Milestone Date | August 30, 2014 |
Description of how this milestone expands the Inventory | In the third quarter of the Open Data effort, the USDA will review and analyze its current dataset inventory, publication process, and customer feedback mechanisms. The results of the analysis will inform whether the Department needs to adjust its Open Data strategy and will help the Department prioritize the submission and release of its datasets in Q4. At the end of Q3, the USDA will submit an updated EDI to OMB. |
Description of how this milestone enriches the Inventory | |
Description of how this milestone opens the Inventory |
Title | Milestone 4 - Perform Quality Assurance and Submit Complete EDI to OMB |
---|---|
Description | |
Milestone Date | November 1, 2014 |
Description of how this milestone expands the Inventory | Customer Feedback |
Description of how this milestone enriches the Inventory | |
Description of how this milestone opens the Inventory |
1.2.6 Develop Customer Feedback Process
Describe the agency's process to engage with customers
1.2.7 Develop Data Publication Process
Describe the agency's data publication process
The Department of Agriculture (USDA) implemented a four step data publication process.* This multi-step dataset review process involves multiple internal stakeholders such as Data Stewards, Chief Information Officers, Privacy Officers, agency legal staff, and Information Security System Program Managers, Records Managers, and Controlled Unclassified Information managers. The purpose of this multi-perspective approach is to ensure that datasets are adequately reviewed and approved prior to release.