General Services Administration

http://www.gsa.gov/

Milestone 15 - May 31st 2017

OMB Review Complete: OMB has completed the agency review for this milestone. Agencies should contact their OMB desk officer if anything looks incorrect.

Leading Indicators

These indicators are reviewed by the Office of Management and Budget

Review Status complete
Reviewer Rebecca Williams
Last Updated December 5, 2017, 9:09 am EST by Rebecca Williams

Assessment Summary

Inventory Composition

Public Dataset Status

Dataset Link Quality

Status Indicator Automated Metrics
Overall Progress this Milestone
Inventory Updated this Quarter
220 Number of Datasets
20 Number of APIs
4 Bureaus represented
100.0% Percentage of bureaus represented
18 Programs represented
78.3% Percentage of programs represented
204 Number of public datasets
1 Number of restricted public datasets
15 Number of non-public datasets
Percentage growth in records since last quarter
To a very great extent (>75%) To what extent is your agency’s Enterprise Data Inventory (EDI) complete?
See below What steps have you taken to ensure your Enterprise Data Inventory is complete
We continue to focus on adherence to the OMB Memorandum M-13-13 Open Data PolicyManaging Information as an Asset, and meeting all required due dates. We strive to develop a clear and comprehensive understanding of the data assets we possess, accounting for all data assets created or collected. All requests (updates/corrections/additions/deletions) from our data asset owners are tracked and completed within 24 hours of receiving the request. Previously, the GSA Open Data team would manually run a harvest job so that requests received from data owners were completed on the same day. With the recent migration to a new infrastructure, if ad hoc needs arise (e.g. to conduct a manual harvest of GSA data.json outside of the usual nightly harvest schedule), we can contact the Data.gov team at datagov@gsa.gov. Our focus continues to be the quality of our datasets and verifying that we have accurate and current contact information for each of our datasets. We continue our work to directly pull data from its original source and to bring all datasets into compliance. Our goal for Open Data remains quality over quantity. We are striving to publish datasets that provide current, meaningful, and useful data to the public and other federal agencies. We plan to work with data owners to pull data directly from the source. We continue to monitor and ensure that our EDI is accurate and complete and shall continue to work on adding datasets and APIs to our inventory.
Agency provides a public Enterprise Data Inventory on Data.gov
Agency provided updated Enterprise Data Inventory to OMB
100.0% License specified Crawl details
Number of datasets with redactions
0.0% Percent of datasets with redactions
Status Indicator Automated Metrics
Overall Progress this Milestone
220 Number of Datasets Crawl details
9 Number of Collections Crawl details
108 Number of datasets not contained in a collection Crawl details
198 Number of Public Datasets with File Downloads Crawl details
20 Number of APIs Crawl details
19 Number of public APIs Crawl details
1 Number of restricted public APIs Crawl details
Number of non-public APIs Crawl details
210 Total number of access and download links Crawl details
Quality Check: Links are sufficiently working Crawl details
156 Quality Check: Accessible links Crawl details
41 Quality Check: Redirected links Crawl details
Quality Check: Error links Crawl details
10 Quality Check: Broken links Crawl details
72.4% Quality Check: Percentage of download links in correct format as specified in metadata Crawl details
1.3% Quality Check: Percentage of download links in HTML Crawl details
1.9% Quality Check: Percentage of download links in PDF Crawl details
Percentage growth in records since last quarter
100% Valid Metadata Crawl details
/data exists Crawl details
Provides datasets in human-readable form on /data
/data.json Crawl details
Harvested by data.gov
204 Number of public datasets Crawl details
1 Number of restricted public datasets Crawl details
15 Number of non-public datasets Crawl details
Percent growth of public datasets
Percent growth of restricted public datasets
Percent growth of non-public datasets
Percent datasets licensed as U.S. Public Domain
Percent datasets licensed as Creative Commons Zero
Percent datasets with other licenses
Percent datasets with no license
Status Indicator Automated Metrics
Overall Progress this Milestone
Description of feedback mechanism delivered Crawl details
Data release is prioritized through public engagement
Provided narrative evidence of data improvements based on public feedback this quarter
Feedback loop is closed, 2 way communication
See below Link to or description of Feedback Mechanism
http://www.gsa.gov/portal/content/140871
Provides valid contact point information for all datasets
Status Indicator Automated Metrics
Overall Progress this Milestone
Data Publication Process Delivered Crawl details
Information that should not to be made public is documented with agency's OGC
See below Describe the agency's data publication process
\r\nInternal Clearance Processing\r\n\r\nThe EIDM team met with representatives of the Office of General Counsel (OGC), the Freedom of Information Access Office (FOIA) and the GSA Privacy Officer to develop an internal clearance process for GSA datasets prior to their release. We agreed that our goal is proactive disclosure of datasets but we will ensure that the clearance process has risk mitigation included. The process that has been implemented includes:\r\n\u25cf\tProgram Manager, the data owner, will approve the dataset and metadata for public release and seek approval from their management. Associate Administrator within the SSO will provide approval to the SSO Portfolio Data Manager (PDM)\r\n\u25cf\tIf the dataset is new and has never been publicly published, the PDM will provide the metadata and dataset to Executive Secretariat office for entry into GSA\u2019s internal correspondence routing system known as IQ.\r\n\u25cf\tFOIA Officer will forward to OGC through IQ. OGC will coordinate with the GSA Privacy Officer and Executive Secretariat.\r\n\u25cf\tIf OGC approves the dataset for release, then the PDM is notified by OGC. OGC closes out the IQ\/Executive Secretariat entry.\r\n\u25cf\tThe PDM will prepare the datasets for release and notify the SSO data owners and EIDM.\r\n\u25cf\tIf there were issues in the course of review, then the concerns will be routed back through the IQ\/Executive Secretariat system by the FOIA or OGC to the PDM to respond to the issue.\r\n\r\nAccess Level Determination\r\n\r\nThrough consultation and coordination with the GSA FOIA office, OGC and Chief Privacy Officer, the decision was made to first use the GSA FOIA exceptions as a basis for initial access level determination. These FOIA exemptions are consistent governmentwide, not solely a GSA policy. Additional privacy analysis is performed within the FOIA, OGC and Chief Privacy offices to ensure PII is not disclosed through the release of a data asset, and that the \u201cmosaic effect\u201d will not create additional security and privacy concerns. The reviewers will document in the Enterprise Data Inventory the reasons for the restricted and private access level determinations.\r\n\u2003\r\n
Status Indicator Automated Metrics
Overall Progress this Milestone
See below Open Data Primary Point of Contact
Joseph Castle; joseph.castle@gsa.gov
POCs identified for required responsibilities
Chief Data Officer (if applicable)
Status Indicator Automated Metrics
Overall Progress this Milestone
Provided narrative evidence of open data impacts for this quarter
Digital Analytics Program on /data
Views on data.gov for this quarter
Percentage growth in views on data.gov for this quarter
Views on agency /data page for this quarter

Automated Metrics

These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot

data.json
Expected Data.json URL http://www.gsa.gov/data.json (From USA.gov Directory)
Resolved Data.json URL https://open.gsa.gov/data.json
Number of Redirects 3 redirects
HTTP Status 200
Content Type application/json
Valid JSON Valid
Detected Data.json Schema federal-v1.1
Datasets with Valid Metadata 100%(220 of 220)
Valid Schema Valid
Datasets 220
Number of Collections 9
Number of datasets not in a collection 108
Datasets with Distribution URLs 90.0% (198 of 220)
Datasets with Download URLs 71.4% (157 of 220)
Total Distribution URLs 210 (but only 156 accessible)
Total Download URLs 169
Total APIs 20
Public APIs 19
Restricted Public APIs 1
Non-public APIs 0
Public Datasets 204
Restricted Public Datasets 1
Non-public Datasets 15
Server Not Found 1.4% (3 of 210)
Working links (HTTP 2xx)
Broken links (HTTP 4xx)
Error Links (HTTP 5xx)
Redirected Links (HTTP 3xx)
Correct format
PDF for raw data
HTML for raw data
Bureaus Represented 5
Programs Represented 18
License Specified 100% (220 of 220)
Datasets with Redactions 0.0% (0 of 220)
Redactions without explanation (rights field) 0.0% (0 of 220)
File Size 332.15KB
Last modified Monday, 22-May-2017 14:59:13 EDT
Last crawl Wednesday, 31-May-2017 03:33:37 EDT
Analyze archive copies Analyze archive from 2017-05-31
Nearby Daily Crawls
/data page
Expected /data URL http://www.gsa.gov/data (From USA.gov Directory)
Resolved /data URL https://www.gsa.gov/portal/category/105839
Redirects 2 redirects
HTTP Status 200
Content Type text/html;charset=UTF-8
Last crawl Wednesday, 31-May-2017 03:31:55 EDT
/digitalstrategy.json
Expected /digitalstrategy.json URL http://www.gsa.gov/digitalstrategy.json (From USA.gov Directory)
Resolved /digitalstrategy.json URL https://www.gsa.gov/digitalstrategy.json
Redirects 1 redirects
HTTP Status 200
Content Type application/json
Valid JSON Invalid Check a JSON Validator
Last modified Tuesday, 10-Dec-2013 16:50:09 EST
Last crawl Wednesday, 31-May-2017 03:31:56 EDT