| Status |
Indicator |
Automated Metrics |
|
|
Overall Progress this Milestone
|
|
|
|
Inventory Updated this Quarter
|
|
|
3184 |
Number of Datasets
|
|
|
2344 |
Number of APIs
|
|
|
1 |
Bureaus represented
|
|
|
100.0% |
Percentage of bureaus represented
|
|
|
8 |
Programs represented
|
|
|
6.6% |
Percentage of programs represented
|
|
|
3132 |
Number of public datasets
|
|
|
11 |
Number of restricted public datasets
|
|
|
41 |
Number of non-public datasets
|
|
|
|
Percentage growth in records since last quarter
|
|
|
To a great extent (50-75%) |
To what extent is your agency’s Enterprise Data Inventory (EDI) complete?
|
|
|
See below |
What steps have you taken to ensure your Enterprise Data Inventory is complete
|
|
|
In March 2015, the EPA CIO signed the EPA Enterprise Information Management Policy (EIMP), the EIMP Cataloguing Data Resources Procedure, and the EIMP Minimum Metadata Standards. The EIMP requires all EPA Organization officials, employees, and individuals or non EPA organizations, if applicable to ensure information is: The EIMP Cataloguing Data Resources Procedure states: The Agency's internal metadata catalog, the Environmental Dataset Gateway (EDG) was established in 2006. The EDG Team has worked with data owners since its inception to catalog Agency datasets. In response to Project Open Data requirements the EPA CIO issued an Agencywide data call asking all EPA organizations to register their datasets in EDG. The EDG Team worked with the Agency's Information Management Officers (IMOs) and the EDG's Stewardship Network and other key data owners to ensure that as many Agency data sets as possible were identified for registration. In addition, EPA's registry for IT systems, READ, was reviewed to ensure that all possible data owners were contacted. Since that time, the EDG team has established an ongoing relationship with the IMOs and has increased its network of stakeholders to ensure that any datasets not identified during the 2013 data call are registered in EDG. Quarterly meetings and training sessions held with these groups to educate them on Open Data requirements and metadata best practices as well as to encourage them to continue cataloguing their datasets. Targeted outreach, based on new entries in READ are conducted to ensure that all datasets are listed in the EDI. This includes working with Offices that have Confidential Business Information to ensure that we have a full registration of all data not shared with the public. In addition, EPA's Security Office is planning an audit of all Agency data systems and is coordinating with the EDG team to ensure that any uncatalogued datasets discovered in this process are registered in EDG and become part of EPA's EDI. And finally, EPA is also developing an |
|
|
Agency provides a public Enterprise Data Inventory on Data.gov
|
|
|
|
Agency provided updated Enterprise Data Inventory to OMB
|
|
|
100% |
License specified
|
Crawl details
|
|
|
Number of datasets with redactions
|
|
|
100% |
Percent of datasets with redactions
|
|
| Status |
Indicator |
Automated Metrics |
|
|
Overall Progress this Milestone
|
|
|
3184 |
Number of Datasets
|
Crawl details
|
|
21 |
Number of Collections
|
Crawl details
|
|
1722 |
Number of datasets not contained in a collection
|
Crawl details
|
|
2531 |
Number of Public Datasets with File Downloads
|
Crawl details
|
|
2344 |
Number of APIs
|
Crawl details
|
|
|
Number of public APIs
|
Crawl details
|
|
|
Number of restricted public APIs
|
Crawl details
|
|
|
Number of non-public APIs
|
Crawl details
|
|
2691 |
Total number of access and download links
|
Crawl details
|
|
|
Quality Check: Links are sufficiently working
|
Crawl details
|
|
1078 |
Quality Check: Accessible links
|
Crawl details
|
|
625 |
Quality Check: Redirected links
|
Crawl details
|
|
6 |
Quality Check: Error links
|
Crawl details
|
|
848 |
Quality Check: Broken links
|
Crawl details
|
|
6.8% |
Quality Check: Percentage of download links in correct format as specified in metadata
|
Crawl details
|
|
60.0% |
Quality Check: Percentage of download links in HTML
|
Crawl details
|
|
0.6% |
Quality Check: Percentage of download links in PDF
|
Crawl details
|
|
|
Percentage growth in records since last quarter
|
|
|
100% |
Valid Metadata
|
Crawl details
|
|
|
/data exists
|
Crawl details
|
|
|
Provides datasets in human-readable form on /data
|
|
|
|
/data.json
|
Crawl details
|
|
|
Harvested by data.gov
|
|
|
3132 |
Number of public datasets
|
Crawl details
|
|
11 |
Number of restricted public datasets
|
Crawl details
|
|
41 |
Number of non-public datasets
|
Crawl details
|
|
|
Percent growth of public datasets
|
|
|
|
Percent growth of restricted public datasets
|
|
|
|
Percent growth of non-public datasets
|
|
|
|
Percent datasets licensed as U.S. Public Domain
|
|
|
|
Percent datasets licensed as Creative Commons Zero
|
|
|
|
Percent datasets with other licenses
|
|
|
|
Percent datasets with no license
|
|
| Status |
Indicator |
Automated Metrics |
|
|
Overall Progress this Milestone
|
|
|
|
Description of feedback mechanism delivered
|
Crawl details
|
|
|
Data release is prioritized through public engagement
|
|
|
|
Provided narrative evidence of data improvements based on public feedback this quarter
|
|
|
|
Feedback loop is closed, 2 way communication
|
|
|
See below |
Link to or description of Feedback Mechanism
|
|
|
https://developer.epa.gov/forums/forum/dataset-qa/ |
|
|
Provides valid contact point information for all datasets
|
|
| Status |
Indicator |
Automated Metrics |
|
|
Overall Progress this Milestone
|
|
|
|
Data Publication Process Delivered
|
Crawl details
|
|
|
Information that should not to be made public is documented with agency's OGC
|
|
|
See below |
Describe the agency's data publication process
|
|
|
EPA’s data registration, evaluation, and publication processes are being developed as part of the Agency’s Environmental Information Management Policy (EIMP). The EIMP is currently undergoing Agency-wide review before being finalized. The EIMP includes a data asset registration procedure that is scheduled to be adopted with the policy. Additional procedures, such as defining the details of the data publication process will follow. The registration and classification of data assets will be addressed by the following: • Issue the EIMP, Cataloging EPA Data Resources Procedure and associated Standard Operating Procedures (SOPs) which require the registration of data assets. Create comprehensive processes to fill metadata gaps for existing records and ensure compliance with the registration of non-listed assets. • Modify the EDG to include an initial “data sensitivity” evaluation during the registration of an asset noting a determination of a range of data sensitivity categories such as: - Controlled Unclassified Information (CUI) - Personally Identifiable Information (PII) - Confidential Business Information (CBI) - Information with National Security sensitivities • EPA currently conducts reviews to evaluate the appropriate release of information to the public, however, to address the anticipated increase in demand for information the Agency is developing a more formal process to document sensitivity determinations and help set expectations when determinations will be completed. EPA’s Office of General Counsel (OGC) is frequently involved in data release determinations and will be a part of the more formal process--making final determinations on data that is deemed too sensitive for disclosure. |
Best Practice: Environmental Protection Agency has been highlighted for demonstrating a best practice on the Human Capital indicator
| Status |
Indicator |
Automated Metrics |
|
|
Overall Progress this Milestone
|
|
|
greene.ana@epa.gov |
Open Data Primary Point of Contact
|
|
|
|
POCs identified for required responsibilities
|
|
|
See below |
Chief Data Officer (if applicable)
|
|
|
thottungal.robin@epa.gov |
| Status |
Indicator |
Automated Metrics |
|
|
Overall Progress this Milestone
|
|
|
|
Provided narrative evidence of open data impacts for this quarter
|
|
|
|
Digital Analytics Program on /data
|
|
|
2154 |
Views on data.gov for this quarter
|
|
|
378.7% |
Percentage growth in views on data.gov for this quarter
|
|
|
|
Views on agency /data page for this quarter
|
|
These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot