Department of Education
Enterprise Data Inventory - Volume and composition over time
M-13-13 Milestone 10 - February 29th 2016
OMB Review Complete: OMB has completed the agency review for this milestone. Agencies should contact their OMB desk officer if anything looks incorrect.
Leading Indicators
These indicators are reviewed by the Office of Management and Budget
| Review Status | complete |
|---|---|
| Reviewer | Justin Grimes |
| Last Updated | December 4, 2017, 9:47 pm EST by Rebecca Williams |
Assessment Summary
Agency fails to document any outstanding Licensing information; Agency should be applauded for College Scorecard; however the Agency does not have a transparent, two-way public feedback mechanism; only blog postings.
Inventory Composition
Public Dataset Status
Dataset Link Quality
| Status | Indicator | Automated Metrics | ||
|---|---|---|---|---|
| Overall Progress this Milestone | ||||
| Inventory Updated this Quarter | ||||
| 285 | Number of Datasets | |||
| 45 | Number of APIs | |||
| 10 | Bureaus represented | |||
| 90.9% | Percentage of bureaus represented | |||
| 1 | Programs represented | |||
| 0.9% | Percentage of programs represented | |||
| 285 | Number of public datasets | |||
| 26 | Number of restricted public datasets | |||
| 2 | Number of non-public datasets | |||
| Percentage growth in records since last quarter | ||||
| To a very great extent (>75%) | To what extent is your agency’s Enterprise Data Inventory (EDI) complete? | |||
| See below | What steps have you taken to ensure your Enterprise Data Inventory is complete | |||
| Currently, the Department of Education's Electronic Data Inventory is complete to a very great extent. Of all datasets identified to be released to the public, approximately 75 percent of those have been made available in some sort of easilyaccessible manner. Releases include providing data through data.gov and several application programming interfaces (APIs) from FSA, along with metadata through ED's Data Inventory. The release of this data has helped families and students make informed decisions about selecting a school or college. Additionally, community, nonprofits, and advocacy groups use this data to identify educational inequality, invest wisely and act on certain community, school, district, or state. Greater transparency and data availability helps all those involved make more informed, datadriven decisions for improving educational outcomes and opportunities. | ||||
| Agency provides a public Enterprise Data Inventory on Data.gov | ||||
| Agency provided updated Enterprise Data Inventory to OMB | ||||
| 0.0% | License specified | Crawl details | ||
| Number of datasets with redactions | ||||
| 0.0% | Percent of datasets with redactions | |||
| Status | Indicator | Automated Metrics |
|---|---|---|
| Overall Progress this Milestone | ||
| 311 | Number of Datasets | Crawl details |
| Number of Collections | Crawl details | |
| 311 | Number of datasets not contained in a collection | Crawl details |
| 240 | Number of Public Datasets with File Downloads | Crawl details |
| 45 | Number of APIs | Crawl details |
| Number of public APIs | Crawl details | |
| Number of restricted public APIs | Crawl details | |
| Number of non-public APIs | Crawl details | |
| 815 | Total number of access and download links | Crawl details |
| Quality Check: Links are sufficiently working | Crawl details | |
| 672 | Quality Check: Accessible links | Crawl details |
| 126 | Quality Check: Redirected links | Crawl details |
| Quality Check: Error links | Crawl details | |
| 17 | Quality Check: Broken links | Crawl details |
| 58.0% | Quality Check: Percentage of download links in correct format as specified in metadata | Crawl details |
| 0.4% | Quality Check: Percentage of download links in HTML | Crawl details |
| 3.1% | Quality Check: Percentage of download links in PDF | Crawl details |
| Percentage growth in records since last quarter | ||
| 100% | Valid Metadata | Crawl details |
| /data exists | Crawl details | |
| Provides datasets in human-readable form on /data | ||
| /data.json | Crawl details | |
| Harvested by data.gov | ||
| 283 | Number of public datasets | Crawl details |
| 26 | Number of restricted public datasets | Crawl details |
| 2 | Number of non-public datasets | Crawl details |
| Percent growth of public datasets | ||
| Percent growth of restricted public datasets | ||
| Percent growth of non-public datasets | ||
| Percent datasets licensed as U.S. Public Domain | ||
| Percent datasets licensed as Creative Commons Zero | ||
| Percent datasets with other licenses | ||
| Percent datasets with no license |
| Status | Indicator | Automated Metrics | ||
|---|---|---|---|---|
| Overall Progress this Milestone | ||||
| Description of feedback mechanism delivered | Crawl details | |||
| Data release is prioritized through public engagement | ||||
| Provided narrative evidence of data improvements based on public feedback this quarter | ||||
| Feedback loop is closed, 2 way communication | ||||
| See below | Link to or description of Feedback Mechanism | |||
| http://www2.ed.gov/digitalstrategy/index.html | ||||
| Provides valid contact point information for all datasets | ||||
| Status | Indicator | Automated Metrics | ||
|---|---|---|---|---|
| Overall Progress this Milestone | ||||
| Data Publication Process Delivered | Crawl details | |||
| Information that should not to be made public is documented with agency's OGC | ||||
| See below | Describe the agency's data publication process | |||
| Much of the information collected by the Department of Education is protected by the privacy or confidentiality provisions of one or more federal statutes. These include: The Family Educational Rights and Privacy Act (FERPA), The Individuals with Disabilities Education Act (IDEA), The Education Sciences Reform Act (ESRA), and The Privacy Act of 1974. To help ensure that the Department does not release any information that would violate the privacy of students or other individuals in violation of these statutes, the Department has implemented a disclosure review process for its public data releases. Information collections containing data protected by one or more of these statutes are reviewed and approved for release by one of the Department's disclosure review boards, prior to their release to the public. | ||||
Automated Metrics
These metrics are generated by an automated analysis that runs every 24 hours until the end of the quarter at which point they become a historical snapshot
data.json
| Expected Data.json URL | http://www.ed.gov/data.json (From USA.gov Directory) |
|---|---|
| Resolved Data.json URL | http://www2.ed.gov/data.json |
| Number of Redirects | 1 redirects |
| HTTP Status | 200 |
| Content Type | application/json |
| Valid JSON | Invalid Check a JSON Validator |
| Detected Data.json Schema | federal-v1.1 |
| Datasets with Valid Metadata | 100%(311 of 311) - The JSON file is invalid and can't be parsed without special processing |
| Valid Schema | Valid |
| Datasets | 311 |
| Number of Collections | 0 |
| Number of datasets not in a collection | 311 |
| Datasets with Distribution URLs | 77.2% (240 of 311) |
| Datasets with Download URLs | 44.4% (138 of 311) |
| Total Distribution URLs | 815 |
| Total Download URLs | 524 |
| Total APIs | 45 |
| Public Datasets | 283 |
| Restricted Public Datasets | 26 |
| Non-public Datasets | 2 |
| Normally there would be a set of quality assurance fields here to verify that the download links included within the metadata are functioning properly, but the results of those tests are not currently available. | |
| Bureaus Represented | 10 |
| Programs Represented | 1 |
| License Specified | 0.0% (0 of 311) |
| Datasets with Redactions | 0.0% (0 of 311) |
| Redactions without explanation (rights field) | 0.0% (0 of 311) |
| File Size | 1.22MB |
| Last modified | Wednesday, 06-Jan-2016 16:34:17 EST |
| Last crawl | Monday, 29-Feb-2016 23:00:59 EST |
| Analyze archive copies | Analyze archive from 2016-02-29 |
| Nearby Daily Crawls | |
| Expected /data URL | http://www.ed.gov/data (From USA.gov Directory) |
|---|---|
| Resolved /data URL | http://www2.ed.gov/rschstat/landing.jhtml |
| Redirects | 2 redirects |
| HTTP Status | 200 |
| Content Type | text/html;charset=UTF-8 |
| Last crawl | Monday, 29-Feb-2016 23:00:39 EST |
| Expected /digitalstrategy.json URL | http://www.ed.gov/digitalstrategy.json (From USA.gov Directory) |
|---|---|
| Resolved /digitalstrategy.json URL | http://www2.ed.gov/digitalstrategy.json |
| Redirects | 1 redirects |
| HTTP Status | 200 |
| Content Type | application/json |
| Valid JSON | Valid |
| Last modified | Tuesday, 01-Sep-2015 10:01:11 EDT |
| Last crawl | Monday, 29-Feb-2016 23:00:39 EST |
Date specified: Tuesday, 02-Dec-2014 08:55:34 EST
Date of digitalstrategy.json file: Tuesday, 01-Sep-2015 10:01:11 EDT1.2.4 Develop Data Inventory Schedule - Summary
Summarize the Inventory Schedule
As of November 30, 2013, the Department of Education's Data Inventory contains detailed information for 33 Data Series, and 223 component "Studies" (collections). The metadata for these data assets were taken from the Department's Paperwork Reduction Act (PRA) information collection submissions to the Office of Management and Budget and from data dictionaries or electronic codebooks where available. By November 30, 2014, the Department anticipates expanding and enriching the Data Inventory with information and metadata on additional existing collections, as resources allow. By that date, the Department will also have established and implemented new data governance policies and procedures to ensure that all new data assets are entered into the Data Inventory as part of the information collection clearance process.
1.2.5 Develop Data Inventory Schedule - Milestones
| Title | Further Development of the ED Data Inventory |
|---|---|
| Description | As of November 30, 2013, the Department of Education's Data Inventory contains detailed information for 33 Data Series, and 223 component "Studies" (collections). The metadata for these data assets were taken from the Department's Paperwork Reduction Act (PRA) information collection submissions to the Office of Management and Budget and from data dictionaries or electronic codebooks. http://datainventory.ed.gov |
| Milestone Date | November 30, 2014 |
| Description of how this milestone expands the Inventory | By November 30, 2014, the Department anticipates expanding and enriching the Data Inventory with information and metadata on additional existing collections, as resources allow. By that date, the Department will also have established and implemented new data governance policies and procedures to ensure that all new data assets are entered into the Data Inventory as part of the information collection clearance process. |
| Description of how this milestone enriches the Inventory | See the description of how this milestone expands the Inventory. More information will be posted as it becomes available. |
| Description of how this milestone opens the Inventory | See the description of the Inventory. More information will be posted as it becomes available. |
1.2.6 Develop Customer Feedback Process
Describe the agency's process to engage with customers
The Department engages with the education community and interested developers and entrepreneurs through social media outlets and the ED official Homeroom Blog. More than 150 of America's entrepreneurs, software developers, education experts, and policy makers came together October 9, 2012, for an Education Datapalooza at the White House. The gathering was a chance to celebrate new products, services, and apps--all built with freely available data from the government and other sources--that have the potential to help American students succeed and that empower students and their families to make informed educational decisions. See videos of talks from Education Datapalooza 2012 at http://www.youtube.com/playlist?list=PLhdwy3ASoEfm1QeH0kfNnLWUqv4lE1pPs. The Department held two data jams at Massachusetts Institute of Technology and Stanford University in December 2013. The Data Jams focused on data related to postsecondary education and led up to a second Education Datapalooza. For Education Datapalooza 2014 on January 15,2014, more than 600 people packed into an auditorium to discuss innovation in higher education. The gathering was a response to President Obama's call this past August to improve value and affordability in postsecondary education, in which he outlined an ambitious plan that included a major focus on innovation. Datapalooza participants gathered to celebrate innovative products, apps, websites, and other tools to help students get to and through postsecondary education. Many of the tools help students and families navigate the college choice and selection process. Others focus on improving teaching and learning, especially in ways that leverage technology to improve online and classroom-based instruction. Videos from talks from Education Datapalooza 2014 are at http://www.youtube.com/playlist?list=PLhdwy3ASoEfkNJ_7ffaaYOhQmi61oLFLX. The goal of the Education Data Jams and Datapalooza is to spark interest and collaboration among entrepreneurs and developers in creating apps and other digital tools to facilitate easier access to higher education information for prospective and current college students.
1.2.7 Develop Data Publication Process
Describe the agency's data publication process
Much of the information collected by the Department of Education is protected by the privacy or confidentiality provisions of one or more federal statutes. These include: The Family Educational Rights and Privacy Act (FERPA), The Individuals with Disabilities Education Act (IDEA), The Education Sciences Reform Act (ESRA), and The Privacy Act of 1974. To help ensure that the Department does not release any information that would violate the privacy of students or other individuals in violation of these statutes, the Department has implemented a disclosure review process for its public data releases. Information collections containing data protected by one or more of these statutes are reviewed and approved for release by one of the Department's disclosure review boards, prior to their release to the public.