General Services Administration

http://www.gsa.gov/

Milestone 11 - May 31st 2016

OMB Review Complete: OMB has completed the agency review for this milestone. Agencies should contact their OMB desk officer if anything looks incorrect.

Leading Indicators

These indicators are reviewed by the Office of Management and Budget

Review Status complete
Reviewer Bryant Renaud
Last Updated August 17, 2016, 4:10 pm EDT by Bryant Renaud

Assessment Summary

EDI is Yellow: some datasets are missing licensing information.

Human Capital is Yellow: Ensure that Single primary open data contact is provided (in IDC).

Inventory Composition

Public Dataset Status

Dataset Link Quality

Status Indicator Automated Metrics
Overall Progress this Milestone
Inventory Updated this Quarter
206 Number of Datasets
22 Number of APIs
5 Bureaus represented
100% Percentage of bureaus represented
18 Programs represented
78.3% Percentage of programs represented
190 Number of public datasets
1 Number of restricted public datasets
15 Number of non-public datasets
15.7% Percentage growth in records since last quarter
To a very great extent (>75%) To what extent is your agency’s Enterprise Data Inventory (EDI) complete?
See below What steps have you taken to ensure your Enterprise Data Inventory is complete
To ensure that our Enterprise Data Inventory (EDI) is complete we continuously monitor our datasets for accuracy of all metadata. All updates/corrections received from Data Owners are implemented the same day to ensure that the data is current and available. We have created and added to our Data Page a humanreadable list of all our datasets. A link was added to the Data Page to allow Visitors to view all available datasets, their Access Level, the file format, a brief description of the dataset, and the POC for the dataset. We are working with the Working Data Management Group, Data Stewards and the D2D team on a new initiative to remove duplicate datasets, identify Data Stewards for our datasets, and verify that the datasets are in compliance with the guidelines contained in the Project Open Data documentation. We plan on meeting with our Data Stewards to discuss any identified dataset issues such as; incorrect file formats, missing metadata, incorrect POC's, and determine if the dataset is providing information to the Public and other Federal Agencies. Our goal continues to be the improvement of our existing datasets, the addition of new datasets that adhere to our established standards, and to meet all stated requirements regarding Open Data.
Agency provides a public Enterprise Data Inventory on Data.gov
Agency provided updated Enterprise Data Inventory to OMB
97.6% License specified Crawl details
Number of datasets with redactions
Percent of datasets with redactions

Best Practice: General Services Administration has been highlighted for demonstrating a best practice on the Public Data Listing indicator

Status Indicator Automated Metrics
Overall Progress this Milestone
206 Number of Datasets Crawl details
9 Number of Collections Crawl details
107 Number of datasets not contained in a collection Crawl details
183 Number of Public Datasets with File Downloads Crawl details
22 Number of APIs Crawl details
21 Number of public APIs Crawl details
1 Number of restricted public APIs Crawl details
Number of non-public APIs Crawl details
194 Total number of access and download links Crawl details
Quality Check: Links are sufficiently working Crawl details
168 Quality Check: Accessible links Crawl details
15 Quality Check: Redirected links Crawl details
Quality Check: Error links Crawl details
8 Quality Check: Broken links Crawl details
76.8 Quality Check: Percentage of download links in correct format as specified in metadata Crawl details
5.4 Quality Check: Percentage of download links in HTML Crawl details
4.2 Quality Check: Percentage of download links in PDF Crawl details
15.7 Percentage growth in records since last quarter
100% Valid Metadata Crawl details
/data exists Crawl details
Provides datasets in human-readable form on /data
/data.json Crawl details
Harvested by data.gov
190 Number of public datasets Crawl details
1 Number of restricted public datasets Crawl details
15 Number of non-public datasets Crawl details
15.2 Percent growth of public datasets
Percent growth of restricted public datasets
25 Percent growth of non-public datasets
7.3 Percent datasets licensed as U.S. Public Domain
88.8 Percent datasets licensed as Creative Commons Zero
1.5 Percent datasets with other licenses
2.4 Percent datasets with no license
Status Indicator Automated Metrics
Overall Progress this Milestone
Description of feedback mechanism delivered Crawl details
Data release is prioritized through public engagement
Provided narrative evidence of data improvements based on public feedback this quarter
Feedback loop is closed, 2 way communication
See below Link to or description of Feedback Mechanism
http://www.gsa.gov/portal/content/140871
Provides valid contact point information for all datasets
Status Indicator Automated Metrics
Overall Progress this Milestone
Data Publication Process Delivered Crawl details
Information that should not to be made public is documented with agency's OGC
See below Describe the agency's data publication process
Internal Clearance Processing The EIDM team met with representatives of the Office of General Counsel (OGC), the Freedom of Information Access Office (FOIA) and the GSA Privacy Officer to develop an internal clearance process for GSA datasets prior to their release. We agreed that our goal is proactive disclosure of datasets but we will ensure that the clearance process has risk mitigation included. The process that has been implemented includes: ● Program Manager, the data owner, will approve the dataset and metadata for public release and seek approval from their management. Associate Administrator within the SSO will provide approval to the SSO Portfolio Data Manager (PDM) ● If the dataset is new and has never been publicly published, the PDM will provide the metadata and dataset to Executive Secretariat office for entry into GSA’s internal correspondence routing system known as IQ. ● FOIA Officer will forward to OGC through IQ. OGC will coordinate with the GSA Privacy Officer and Executive Secretariat. ● If OGC approves the dataset for release, then the PDM is notified by OGC. OGC closes out the IQ/Executive Secretariat entry. ● The PDM will prepare the datasets for release and notify the SSO data owners and EIDM. ● If there were issues in the course of review, then the concerns will be routed back through the IQ/Executive Secretariat system by the FOIA or OGC to the PDM to respond to the issue. Access Level Determination Through consultation and coordination with the GSA FOIA office, OGC and Chief Privacy Officer, the decision was made to first use the GSA FOIA exceptions as a basis for initial access level determination. These FOIA exemptions are consistent governmentwide, not solely a GSA policy. Additional privacy analysis is performed within the FOIA, OGC and Chief Privacy offices to ensure PII is not disclosed through the release of a data asset, and that the “mosaic effect” will not create additional security and privacy concerns. The reviewers will document in the Enterprise Data Inventory the reasons for the restricted and private access level determinations.
Status Indicator Automated Metrics
Overall Progress this Milestone
See below Open Data Primary Point of Contact
joseph.castle@gsa.gov, kevin.wince@gsa.gov, jonah.hatfield@gsa.gov
POCs identified for required responsibilities
See below Chief Data Officer (if applicable)
joseph.castle@gsa.gov, kevin.wince@gsa.gov, jonah.hatfield@gsa.gov
Status Indicator Automated Metrics
Overall Progress this Milestone
Provided narrative evidence of open data impacts for this quarter
Digital Analytics Program on /data
364 Views on data.gov for this quarter
5.8% Percentage growth in views on data.gov for this quarter
387 Views on agency /data page for this quarter

Automated Metrics

These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot

data.json
Expected Data.json URL http://www.gsa.gov/data.json (From USA.gov Directory)
Resolved Data.json URL http://open.gsa.gov/data.json
Number of Redirects 1 redirects
HTTP Status 200
Content Type application/json
Valid JSON Valid
Detected Data.json Schema federal-v1.1
Datasets with Valid Metadata 100%(206 of 206)
Valid Schema Valid
Datasets 206
Number of Collections 9
Number of datasets not in a collection 107
Datasets with Distribution URLs 88.8% (183 of 206)
Datasets with Download URLs 68.4% (141 of 206)
Total Distribution URLs 194
Total Download URLs 152
Total APIs 22
Public APIs 21
Restricted Public APIs 1
Non-public APIs 0
Public Datasets 190
Restricted Public Datasets 1
Non-public Datasets 15
Bureaus Represented 5
Programs Represented 18
License Specified 97.6% (201 of 206)
Datasets with Redactions 0.0% (0 of 206)
Redactions without explanation (rights field) 0.0% (0 of 206)
File Size 308.41KB
Last modified Thursday, 19-May-2016 10:41:00 EDT
Last crawl Tuesday, 31-May-2016 00:04:32 EDT
Analyze archive copies Analyze archive from 2016-05-31
Nearby Daily Crawls
/data page
Expected /data URL http://www.gsa.gov/data (From USA.gov Directory)
Resolved /data URL http://www.gsa.gov/portal/category/105839
Redirects 1 redirects
HTTP Status 200
Content Type text/html;charset=UTF-8
Last crawl Tuesday, 31-May-2016 00:04:29 EDT
/digitalstrategy.json
Expected /digitalstrategy.json URL http://www.gsa.gov/digitalstrategy.json (From USA.gov Directory)
Resolved /digitalstrategy.json URL http://www.gsa.gov/digitalstrategy.json
Redirects
HTTP Status 200
Content Type application/json
Valid JSON Invalid Check a JSON Validator
Last modified Tuesday, 10-Dec-2013 16:50:09 EST
Last crawl Tuesday, 31-May-2016 00:04:29 EDT