National Science Foundation

http://www.nsf.gov/

Milestone 11 - May 31st 2016

OMB Review Complete: OMB has completed the agency review for this milestone. Agencies should contact their OMB desk officer if anything looks incorrect.

Leading Indicators

These indicators are reviewed by the Office of Management and Budget

Review Status complete
Reviewer Bryant Renaud
Last Updated August 17, 2016, 4:11 pm EDT by Bryant Renaud

Assessment Summary

EDI is Yellow: Fails to document outstanding Licensing information in EDI.

PDL is Yellow: Agency provides some datasets in human-readable form on agency's /data page, but not all.

Other: It seems agency may not have implemented DAP token correctly on /data page.

Inventory Composition

Public Dataset Status

Dataset Link Quality

Status Indicator Automated Metrics
Overall Progress this Milestone
Inventory Updated this Quarter
156 Number of Datasets
3 Number of APIs
1 Bureaus represented
100% Percentage of bureaus represented
2 Programs represented
14.3% Percentage of programs represented
153 Number of public datasets
1 Number of restricted public datasets
2 Number of non-public datasets
5.4% Percentage growth in records since last quarter
To a very great extent (>75%) To what extent is your agency’s Enterprise Data Inventory (EDI) complete?
See below What steps have you taken to ensure your Enterprise Data Inventory is complete
NSF appreciates the opportunity to review the quarterly action items to improve and enhance the Enterprise Data Inventory's completeness. To ensure EDI completeness each quarter, NSF conducts a review of agency data sources recently added to our webpages and coordinates with internal NSF POCs to determine if there are new datasets for potential inclusion in the updated EDI. During this process, we also look into ways in which we can improve the quality of our datasets. These data improvements include: reviewing existing datasets and related metadata and adding metadata that will improve access to content; ensuring newly identified agency data sources are added to the updated EDI; ensuring all appropriate data assets are grouped as collections; and including new datasets and/or updating metadata in the Enterprise Data Inventory to reflect the public feedback received.
Agency provides a public Enterprise Data Inventory on Data.gov
Agency provided updated Enterprise Data Inventory to OMB
90.4% License specified Crawl details
Number of datasets with redactions
Percent of datasets with redactions
Status Indicator Automated Metrics
Overall Progress this Milestone
156 Number of Datasets Crawl details
1 Number of Collections Crawl details
122 Number of datasets not contained in a collection Crawl details
153 Number of Public Datasets with File Downloads Crawl details
3 Number of APIs Crawl details
3 Number of public APIs Crawl details
Number of restricted public APIs Crawl details
Number of non-public APIs Crawl details
164 Total number of access and download links Crawl details
Quality Check: Links are sufficiently working Crawl details
157 Quality Check: Accessible links Crawl details
3 Quality Check: Redirected links Crawl details
Quality Check: Error links Crawl details
3 Quality Check: Broken links Crawl details
75.2 Quality Check: Percentage of download links in correct format as specified in metadata Crawl details
29.3 Quality Check: Percentage of download links in HTML Crawl details
13.4 Quality Check: Percentage of download links in PDF Crawl details
5.4 Percentage growth in records since last quarter
100% Valid Metadata Crawl details
/data exists Crawl details
Provides datasets in human-readable form on /data
/data.json Crawl details
Harvested by data.gov
153 Number of public datasets Crawl details
1 Number of restricted public datasets Crawl details
2 Number of non-public datasets Crawl details
5.5 Percent growth of public datasets
Percent growth of restricted public datasets
Percent growth of non-public datasets
Percent datasets licensed as U.S. Public Domain
Percent datasets licensed as Creative Commons Zero
90.4 Percent datasets with other licenses
9.6 Percent datasets with no license
Status Indicator Automated Metrics
Overall Progress this Milestone
Description of feedback mechanism delivered Crawl details
Data release is prioritized through public engagement
Provided narrative evidence of data improvements based on public feedback this quarter
Feedback loop is closed, 2 way communication
See below Link to or description of Feedback Mechanism
NSF is committed to providing twoway feedback mechanisms to inspire public engagement of open data. One example of this is the NSF Blog of the Division of Environmental Biology (https://debblog.nsfbio.com/). DEBrief provides information on NSF BIO funding opportunity updates, analysis of programs, and discussions of the merit review and decisionmaking process. On May 19, 2016, DEBrief created a blog post asking for public feedback on the content and type of information shared on the blog, whereby the public was encouraged to submit ideas in the 'Comments' section of the blog.
Provides valid contact point information for all datasets
Status Indicator Automated Metrics
Overall Progress this Milestone
Data Publication Process Delivered Crawl details
Information that should not to be made public is documented with agency's OGC
See below Describe the agency's data publication process
NSF utilizes a regular process to review newly identified data sets prior to their planned release, to validate that there are no concerns with respect to privacy, confidentiality, security, contractual restrictions, or other factors. This process is similar to the agency approval process for the publication of agency data in Data.gov and incorporates best practices from the agency’s Freedom of Information Act (FOIA) program to ensure the presumption of openness is being applied to all agency data release decisions. When a potential restriction to release is identified, agency points of contact for Open Data will work with the Office of General Counsel and other subject matter experts as appropriate to review the concerns and, if required, document the determined barrier to release. Some potential characteristics of agency data that could prevent public release include privacy considerations (e.g., personally identifiable information); confidentiality matters (e.g., predecisional or deliberative material); contractual restrictions (e.g., contractor bidding information). Because of the nature of NSF’s mission, one common restriction for public release of data would likely be limitations in the full release of proposal-related data that may contain confidential, proprietary business information protected by FOIA Exemption 4.
Status Indicator Automated Metrics
Overall Progress this Milestone
aesmith@nsf.gov Open Data Primary Point of Contact
POCs identified for required responsibilities
Chief Data Officer (if applicable)
Status Indicator Automated Metrics
Overall Progress this Milestone
Provided narrative evidence of open data impacts for this quarter
Digital Analytics Program on /data
181 Views on data.gov for this quarter
14.6% Percentage growth in views on data.gov for this quarter
Views on agency /data page for this quarter

Automated Metrics

These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot

data.json
Expected Data.json URL http://www.nsf.gov/data.json (From USA.gov Directory)
Resolved Data.json URL http://www.nsf.gov/data.json
Number of Redirects
HTTP Status 200
Content Type text/plain
Valid JSON Invalid Check a JSON Validator
Detected Data.json Schema federal-v1.1
Datasets with Valid Metadata 100%(156 of 156) - The JSON file is invalid and can't be parsed without special processing
Valid Schema Valid
Datasets 156
Number of Collections 1
Number of datasets not in a collection 122
Datasets with Distribution URLs 98.1% (153 of 156)
Datasets with Download URLs 95.5% (149 of 156)
Total Distribution URLs 164
Total Download URLs 158
Total APIs 3
Public APIs 3
Restricted Public APIs 0
Non-public APIs 0
Public Datasets 153
Restricted Public Datasets 1
Non-public Datasets 2
Bureaus Represented 1
Programs Represented 2
License Specified 90.4% (141 of 156)
Datasets with Redactions 0.0% (0 of 156)
Redactions without explanation (rights field) 0.0% (0 of 156)
File Size 171.75KB
Last crawl Tuesday, 31-May-2016 00:13:24 EDT
Analyze archive copies Analyze archive from 2016-05-31
Nearby Daily Crawls
/data page
Expected /data URL http://www.nsf.gov/data (From USA.gov Directory)
Resolved /data URL http://www.nsf.gov/data/
Redirects 1 redirects
HTTP Status 200
Content Type text/html;charset=ISO-8859-1
Last crawl Tuesday, 31-May-2016 00:13:14 EDT
/digitalstrategy.json
Expected /digitalstrategy.json URL http://www.nsf.gov/digitalstrategy.json (From USA.gov Directory)
Resolved /digitalstrategy.json URL http://www.nsf.gov/digitalstrategy.json
Redirects
HTTP Status 200
Content Type text/plain
Valid JSON Invalid Check a JSON Validator
Last crawl Tuesday, 31-May-2016 00:13:15 EDT