2. Brainstorm & Identify Datasets

Think about dataset within each data systems

Once you have identified you data systems, the next step is to brainstorm dataset within each system.

Some of datasets may be fairly straightforward (e.g. a single sheet in a spreadsheet). But others, like relational databases, may be very complex. Identifying subsets of the database that could serve as datasets, probably requires some brainstorming. You may want to include your PIO, data stewards and lead analysts in this process.

To help brainstorm, use the questions below:

  • What data populates your monthly or quarterly reports?

  • What data does your department use for internal performance and trend analysis?

  • What data is reported to federal, state or local agencies?

  • Talk with your Public Information Officer (PIO) - what data has been requested under Sunshine?

  • What data do other departments ask for?

  • What kinds of open data are similar agencies across the country publishing?

Don’t exclude any datasets based on privacy or confidentiality concerns! Our goal is to have a holistic picture of our data. Based on this big picture, we can then decide what we should publish. Step 3 provides a means to capture privacy and confidentiality concerns.

Last updated