Department of State

http://www.state.gov

Milestone 8 - August 31st 2015

OMB Review Complete: OMB has completed the agency review for this milestone. Agencies should contact their OMB desk officer if anything looks incorrect.

Leading Indicators

These indicators are reviewed by the Office of Management and Budget

Review Status complete
Reviewer Justin Grimes
Last Updated October 6, 2015, 10:16 am EDT by Justin Grimes

Assessment Summary

Little to no progress this quarter.

Steps for improving: 1) Increase link quality; check decrease in datasets in PDL 2) Make EDI available and discoverable; Disclose public EDI per FOIA request and IDC requirements 3) Take concrete steps to improve public engagement

Public Dataset Status

Dataset Link Quality

Status Indicator Automated Metrics
Overall Progress this Milestone
Inventory Updated this Quarter
Number of Datasets
Number of APIs
Schedule Delivered Crawl details
1 Bureaus represented
2 Programs represented
Number of public datasets
Number of restricted public datasets
Number of non-public datasets
Inventory > Public listing
Percentage growth in records since last quarter
Spot Check - datasets listed by search engine
Agency provides a public Enterprise Data Inventory on Data.gov
License specified Crawl details
Status Indicator Automated Metrics
Overall Progress this Milestone
93 Number of Datasets Crawl details
Number of Collections Crawl details
93 Number of Public Datasets with File Downloads Crawl details
Number of APIs Crawl details
186 Total number of access and download links Crawl details
Quality Check: Links are sufficiently working Crawl details
142 Quality Check: Accessible links Crawl details
40 Quality Check: Redirected links Crawl details
Quality Check: Error links Crawl details
2 Quality Check: Broken links Crawl details
-20% Percentage growth in records since last quarter
100% Valid Metadata Crawl details
/data exists Crawl details
/data.json Crawl details
Harvested by data.gov
197 Views on data.gov for the quarter
Status Indicator Automated Metrics
Overall Progress this Milestone
Description of feedback mechanism delivered Crawl details
Data release is prioritized through public engagement
Feedback loop is closed, 2 way communication
Link to or description of Feedback Mechanism
Status Indicator Automated Metrics
Overall Progress this Milestone
Data Publication Process Delivered Crawl details
Information that should not to be made public is documented with agency's OGC
Status Indicator Automated Metrics
Overall Progress this Milestone
BlakelySE@state.gov Open Data Primary Point of Contact
POCs identified for required responsibilities
Status Indicator Automated Metrics
Overall Progress this Milestone
Identified 5 data improvements for this quarter
See below Primary Uses
The Department is currently developing a process to address all Project Open Data requirements. The Enterprise Data Inventory (EDI) will be updated based on revisions made to the Open Data Policy. Additional enhancements to our data capture system are being made in order to comply with the new guidance received for the FY 2015 Q4 IDC. 1. We are presently developing a process to capture the dataset owners’ “license” and “rights” fields. The EDI will be updated based on revisions made to the Open Data Policy. The existing data capture system is receiving additional enhancements that will ensure system compliance with the new guidance received for the FY 2015 Q4 IDC. 2. The data assets made public as of June 30, 2015, are not part of another dataset. We are updating the process and our software tool to make sure that all released data assets will include “identifier” and “isPartOf” fields if they include individual datasets. 3. In order to enhance our ability to promote improvements that provide greater informational usefulness and public transparency, we are currently developing a bi-directional customer engagement process and survey tools to collect metadata from organizations that access the Department’s information assets published at www.state.gov.
Value or impact of data
Primary data discovery channels
User suggestions on improving data usability
User suggestions on additional data releases
Digital Analytics Program on /data

Automated Metrics

These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot

data.json
Expected Data.json URL http://www.state.gov/data.json (From USA.gov Directory)
Resolved Data.json URL http://www.state.gov/data.json
Number of Redirects
HTTP Status 200
Content Type application/json
Valid JSON Valid
Detected Data.json Schema federal-v1.1
Datasets with Valid Metadata 100%(93 of 93)
Valid Schema Valid
Datasets 93
Datasets with Distribution URLs 100% (93 of 93)
Datasets with Download URLs 100% (93 of 93)
Total Distribution URLs 186
Total Download URLs 93
Total APIs 0
Public Datasets 93
Restricted Public Datasets 0
Non-public Datasets 0
Bureaus Represented 1
Programs Represented 2
License Specified 0.0% (0 of 93)
Datasets with Redactions 0.0% (0 of 93)
Redactions without explanation (rights field) 0.0% (0 of 93)
File Size 223.70KB
Last modified Sunday, 30-Aug-2015 16:00:05 EDT
Last crawl Monday, 31-Aug-2015 00:01:47 EDT
Analyze archive copies Analyze archive from 2015-08-31
Nearby Daily Crawls
/data page
Expected /data URL http://www.state.gov/data (From USA.gov Directory)
Resolved /data URL http://www.state.gov/data/
Redirects 1 redirects
HTTP Status 200
Content Type text/html
Last modified Sunday, 30-Aug-2015 03:10:19 EDT
Last crawl Monday, 31-Aug-2015 00:01:46 EDT
/digitalstrategy.json
Expected /digitalstrategy.json URL http://www.state.gov/digitalstrategy.json (From USA.gov Directory)
Resolved /digitalstrategy.json URL http://www.state.gov/digitalstrategy.json
Redirects
HTTP Status 200
Content Type application/json
Valid JSON Valid
Last modified Sunday, 30-Aug-2015 16:00:02 EDT
Last crawl Monday, 31-Aug-2015 00:01:46 EDT
Digital Strategy

Date specified: Tuesday, 12-Nov-2013 14:16:11 EST

Date of digitalstrategy.json file: Sunday, 30-Aug-2015 16:00:02 EDT

1.2.4 Develop Data Inventory Schedule - Summary

Summarize the Inventory Schedule


Milestone 1 / Initial Delivery / November 30, 2013
  Number of datasets:  113
  Open Datasets:  99
Milestone 1 / 1st Quarterly Update / February 28, 2014
  Datasets Expanded:  36  (149 total datasets)
  Datasets Enriched:   18
  Datasets Open:  9  (108 total open datasets)
Milestone 3 / 2nd Quarterly Update / May 31, 2014
  Datasets Expanded:  72  (221 total datasets)
  Datasets Enriched:  18
  Datasets Open:  9  (117 total open datasets)
Milestone 4 / 3rd Quarterly Update / August 30, 2014
  Datasets Expanded:  72    (293 total datasets)
  Datasets Enriched:   36
  Datasets Open:  18  (126 total open datasets)
Milestone 5 / 4th Quarterly Update / November 30, 2014
  Datasets Expanded:  72    (365 total datasets)
  Datasets Enriched:   36
  Datasets Open:  18  (144 total open datasets)

1.2.5 Develop Data Inventory Schedule - Milestones

TitleInitial Delivery
DescriptionThe initial delivery of the Open Data Plan, the Schedule, the Enterprise Data Inventory and the Public Data Listing
Milestone DateNovember 30, 2013
Description of how this milestone expands the InventorySee paragraph 1.2.7 Develop Data Publication Process, Number of datasets: 113
Description of how this milestone enriches the InventoryN/A
Description of how this milestone opens the InventorySee paragraph 1.2.7 Develop Data Publication Process, Open Datasets: 99
Title1st Quarterly Update
DescriptionUpdate Open Data Plan, Schedule, Enterprise Data Inventory and Public Data Listing
Milestone DateFebruary 28, 2014
Description of how this milestone expands the InventorySee paragraph 1.2.7 Develop Data Publication Process, Datasets Expanded: 36 (149 total datasets)
Description of how this milestone enriches the InventorySee paragraph 1.2.7 Develop Data Publication Process, Datasets Enriched: 18
Description of how this milestone opens the InventorySee paragraph 1.2.7 Develop Data Publication Process, Datasets Open: 9 (108 total open datasets)
Title2nd Quarterly Update
DescriptionUpdate Open Data Plan, Schedule, Enterprise Data Inventory and Public Data Listing
Milestone DateMay 31, 2014
Description of how this milestone expands the InventorySee paragraph 1.2.7 Develop Data Publication Process, Number of datasets: 72 (221 total datasets)
Description of how this milestone enriches the InventorySee paragraph 1.2.7 Develop Data Publication Process, Datasets Enriched: 18
Description of how this milestone opens the InventorySee paragraph 1.2.7 Develop Data Publication Process, Open Datasets: 9 (117 total open datasets)
Title3rd Quarterly Update
DescriptionUpdate Open Data Plan, Schedule, Enterprise Data Inventory and Public Data Listing
Milestone DateAugust 30, 2014
Description of how this milestone expands the InventorySee paragraph 1.2.7 Develop Data Publication Process, Number of datasets: 72 (293 total datasets)
Description of how this milestone enriches the InventorySee paragraph 1.2.7 Develop Data Publication Process, Datasets Enriched: 36
Description of how this milestone opens the InventorySee paragraph 1.2.7 Develop Data Publication Process, Open Datasets: 18 (126 total open datasets)
Title4th Quarterly Update
DescriptionUpdate Open Data Plan, Schedule, Enterprise Data Inventory and Public Data Listing
Milestone DateNovember 30, 2014
Description of how this milestone expands the InventorySee paragraph 1.2.7 Develop Data Publication Process, Number of datasets: 72 (365 total datasets)
Description of how this milestone enriches the InventorySee paragraph 1.2.7 Develop Data Publication Process, Datasets Enriched: 36
Description of how this milestone opens the InventorySee paragraph 1.2.7 Develop Data Publication Process, Open Datasets: 18 (144 total open datasets)

1.2.6 Develop Customer Feedback Process

Describe the agency's process to engage with customers


Identifying and engaging with key data customers to help determine the value of federal data assets can help agencies prioritize those of highest value for quickest release. Customers will be engaged through blog entries, email, forms on the www.State.gov/open web page, and other means as appropriate. Customers include public as well as government stakeholders.  Internal customers will use blogs, email and Corridor (the Department social media site) to interact with data owners directly. The Department will evaluate public and private input and reflect on how to incorporate it into their data management practices. The Department will regularly review its evolving customer feedback and public engagement strategy and develop criteria for prioritizing the opening of data assets, accounting for factors such as the quantity and quality of user demand, internal management priorities, and agency mission relevance.  

1.2.7 Develop Data Publication Process

Describe the agency's data publication process


The System Owner (new or existing system) will identify all key data sets that can be created and published.  The System Owner captures the core metadata information about the data set in iMatrix.  The extended metadata, like record layout or permissible values, are entered into the Enterprise Metadata Registry.  When entering the metadata the System Owner consults with Data Steward about the correct categorization of the data:  public, restricted public, or non-public. Legal will have the responsibility to make the final determination if the data can be open.  The iMatrix system owner will designate a user that will perform the metadata extraction process on the EDI, and subsequently process the data into a JSON file.  The JSON file will be published on the www.state.gov/data page.  This process will be done periodically, and not less than quarterly at the start.  

Every quarter the Department will target specific bureaus/offices and IT systems within its portfolio to reach out and communicate the Open Data Policy and obtain the datasets that they are currently producing. The list of the datasets will be made available through the Enterprise Data Inventory. Once it is initially entered – the dataset owner will be responsible for the update and maintenance of the dataset and the associated metadata.