Department of Health and Human Services

http://www.hhs.gov/

Milestone 14 - February 28th 2017

OMB Review In Progress: OMB is currently reviewing the agency for this milestone. This review status indicator will change once the review is complete.

Leading Indicators

These indicators are reviewed by the Office of Management and Budget

Review Status in-progress
Reviewer Bryant Renaud
Last Updated April 20, 2017, 10:42 am EDT by Bryant Renaud

Assessment Summary

Please update your POC for Open Data.

EDI is Red: Agency fails to document non-public datasets. Some datasets are missing licensing information. Fails to document APIs.

PDL is Yellow: 163 datasets have schema validation errors: https://labs.data.gov/dashboard/validate.

Inventory Composition

Public Dataset Status

Dataset Link Quality

Status Indicator Automated Metrics
Overall Progress this Milestone
Inventory Updated this Quarter
1928 Number of Datasets
Number of APIs
8 Bureaus represented
57.1% Percentage of bureaus represented
21 Programs represented
18.1% Percentage of programs represented
1873 Number of public datasets
55 Number of restricted public datasets
Number of non-public datasets
8.7% Percentage growth in records since last quarter
To a very great extent (>75%) To what extent is your agency’s Enterprise Data Inventory (EDI) complete?
See below What steps have you taken to ensure your Enterprise Data Inventory is complete
To ensure the Enterprise Data Inventory is complete, we try to link the data assets to the information systems catalogued in the Department's enterprise architecture repository.This approach supports monitoring the process of maturing the EDI through various internal reports. Furthermore, the visibility of data assets in the repository facilitates the support of current and future data sharing and interoperability. In addition, the Department has an approach to manage the inventory by asking for major IT investment assets first and minor investments over time.
Agency provides a public Enterprise Data Inventory on Data.gov
Agency provided updated Enterprise Data Inventory to OMB
91.7% License specified Crawl details
Number of datasets with redactions
Percent of datasets with redactions
Status Indicator Automated Metrics
Overall Progress this Milestone
1928 Number of Datasets Crawl details
Number of Collections Crawl details
1928 Number of datasets not contained in a collection Crawl details
1428 Number of Public Datasets with File Downloads Crawl details
Number of APIs Crawl details
Number of public APIs Crawl details
Number of restricted public APIs Crawl details
Number of non-public APIs Crawl details
3155 Total number of access and download links Crawl details
Quality Check: Links are sufficiently working Crawl details
1974 Quality Check: Accessible links Crawl details
1026 Quality Check: Redirected links Crawl details
34 Quality Check: Error links Crawl details
99 Quality Check: Broken links Crawl details
0.9% Quality Check: Percentage of download links in correct format as specified in metadata Crawl details
11.6% Quality Check: Percentage of download links in HTML Crawl details
0.2% Quality Check: Percentage of download links in PDF Crawl details
8.7% Percentage growth in records since last quarter
91.5% Valid Metadata Crawl details
/data exists Crawl details
Provides datasets in human-readable form on /data
/data.json Crawl details
Harvested by data.gov
1873 Number of public datasets Crawl details
55 Number of restricted public datasets Crawl details
Number of non-public datasets Crawl details
9.0% Percent growth of public datasets
0.0% Percent growth of restricted public datasets
0.0% Percent growth of non-public datasets
Percent datasets licensed as U.S. Public Domain
Percent datasets licensed as Creative Commons Zero
Percent datasets with other licenses
Percent datasets with no license
Status Indicator Automated Metrics
Overall Progress this Milestone
Description of feedback mechanism delivered Crawl details
Data release is prioritized through public engagement
Provided narrative evidence of data improvements based on public feedback this quarter
Feedback loop is closed, 2 way communication
See below Link to or description of Feedback Mechanism
HealthData.gov offers a comment mechanism which can accept public feedback associated with a dataset or feature request for the platform. Responses to those comments are posted publicly. Similarly our Demand Driven Open Data pilot has opened the pipeline for more robust data discussions found at ddod.healthdata.gov. Lastly the Health Datapalooza (HDP) is the premier live event for public interaction with government health data curators.
Provides valid contact point information for all datasets
Status Indicator Automated Metrics
Overall Progress this Milestone
Data Publication Process Delivered Crawl details
Information that should not to be made public is documented with agency's OGC
Describe the agency's data publication process
Status Indicator Automated Metrics
Overall Progress this Milestone
Damon.Davis@hhs.gov Open Data Primary Point of Contact
POCs identified for required responsibilities
Chief Data Officer (if applicable)
Status Indicator Automated Metrics
Overall Progress this Milestone
Provided narrative evidence of open data impacts for this quarter
Digital Analytics Program on /data
Views on data.gov for this quarter
Percentage growth in views on data.gov for this quarter
Views on agency /data page for this quarter

Automated Metrics

These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot

data.json
Expected Data.json URL http://www.hhs.gov/data.json (From USA.gov Directory)
Resolved Data.json URL https://www.healthdata.gov/data.json
Number of Redirects 4 redirects
HTTP Status 200
Content Type application/json
Valid JSON Valid
Detected Data.json Schema federal-v1.1
Datasets with Valid Metadata 91.5%(1765 of 1928)
Valid Schema Invalid
For more complete and readable validation results, see the full schema validator results
Schema Errors There are validation errors on 163 records

Only showing errors from the first 10 records:

Errors on record 354:
programCode
Errors on record 961:
bureauCode
  • The property bureauCode is required
Errors on record 1040:
bureauCode
  • The property bureauCode is required
Errors on record 1076:
bureauCode
  • The property bureauCode is required
Errors on record 1171:
bureauCode
  • The property bureauCode is required
Errors on record 1184:
bureauCode
  • The property bureauCode is required
Errors on record 1283:
bureauCode
  • The property bureauCode is required
Errors on record 1290:
bureauCode
  • The property bureauCode is required
Errors on record 1291:
bureauCode
  • The property bureauCode is required
Errors on record 1292:
bureauCode
  • The property bureauCode is required
Datasets 1928
Number of Collections 0
Number of datasets not in a collection 1928
Datasets with Distribution URLs 74.1% (1428 of 1928)
Datasets with Download URLs 74.1% (1428 of 1928)
Total Distribution URLs 3155
Total Download URLs 3155
Total APIs 0
Public APIs 0
Restricted Public APIs 0
Non-public APIs 0
Public Datasets 1873
Restricted Public Datasets 55
Non-public Datasets 0
Bureaus Represented 8
Programs Represented 21
License Specified 91.7% (1768 of 1928)
Datasets with Redactions 0.0% (0 of 1928)
Redactions without explanation (rights field) 0.0% (0 of 1928)
File Size 3.05MB
Last modified Tuesday, 28-Feb-2017 23:01:02 EST
Last crawl Tuesday, 28-Feb-2017 23:01:04 EST
Analyze archive copies Analyze archive from 2017-02-28
Nearby Daily Crawls