EPA/OW header
Monitoring Water Quality


6.1
Managing Volunteer Data

The following steps will help ensure that the data collected by volunteers are well managed, credible, and of value to potential data users.

Review Field Data Sheets

The volunteer program coordinator or designated analyst should screen and review the field data sheets as they are received. This involves some basic "reality checks." Questions that should be kept in mind include the following:

Review Information in Your Database

Once volunteer data enters a computerized database, it can take on a life of its own. It is a phenomenon of human nature that data suddenly seem more believable once computerized. Therefore, be sure to carefully screen information as soon as yo u enter it into a database. Then review a printout (preferably with a fresh pair of eyes) against the original field data sheets. One way to minimize transcription errors is to design the computer input screens to look like the field data forms.

As a further check, you can run some simple calculations like determining medians and means to make sure no errors have slipped through. (If the median and the mean are very different, an outlier may be skewing the results.) Again, if you uncover unusual data points that cannot be explained by backup information on the field data sheets or the comment field in the database, flag the data as questionable until it can be verified.

Review Your Final Results

Once volunteer monitoring data has been entered into a database, the next step is to generate reports on the findings of the data. Even at this stage you should continue to look for inconsistencies and problems. For example, you should: By the time you present your final results to your volunteers or other data users, you should feel fully confident that you have assembled the best possible picture of water quality conditions in your study streams.

Develop a Coding System

A coding system will help simplify the tracking and recording of data. Make sure, however, that the system you create is easily understood and simple to use. Codes developed for sample sites, parameters, and other information on field and lab sheets shoul d parallel the codes you use in your database. If you will be sharing your information with a state or local natural resource agency, you may want your coding system to match or complement the agency system.

Sample Sites: Because sample sites tend to change over time, it is important to have a site numbering system that accommodates change. A good convention to follow is to use a site coding system that includes an abbreviation of the waterbody and a s ite number (e.g., CtR020 for a site on the Connecticut River). For consistency, you might choose to start the site numbers at the downstream end of the stream and increase them as you move upstream (e.g., the first Connecticut River site would be CtR010, the second CtR020, etc.). Leave extra numbers between sites to allow for your program's future expansion.

Water Quality Parameters: It is also important to develop a coding system for each of the water quality parameters you are testing. These are the codes you will use in the database to identify and extract results. To keep the amount of clerical wor k to a minimum, abbreviate without losing the ability to distinguish parameters from one another. For example, EC could represent E. coli bacteria and FC fecal coliform bacteria.

Spreadsheets, Databases, and Mapping Software

Today's computer software includes a variety of spreadsheet and database packages that allow you to sort, manipulate, and perform statistical analyses on the data you have entered into the computer. For most applications, spreadsheets are adequate and hav e the advantage of being relatively simple to use. Most spreadsheet packages have graphics capabilities that will allow you to plot your data onto a graph of your choice (i.e., bar, line, or pie chart). Examples of common spreadsheet software packages are Lotus 123, Excel, and Quattro Pro.

Database software may be more difficult to master and usually lack the graphics capabilities of spreadsheet software. If you manage large amounts of data, however, a database is almost a necessity. Using a databa se, you can store and manipulate very large data sets without sacrificing speed. The database can also relate records in one file to records in another file. This allows you to break your data up into smaller, more easily managed files that can work toget her as though they were one.

If you use database software for storage and retrieval, you may still want to use a spreadsheet or other program with graphics capabilities. Many spreadsheet and database software packages are compatible and will allow you to transport sets of data back a nd forth with relative ease. Very large data sets can be organized and manipulated in a database. Specific parts of the data (such as results for a particular metric from all stations and all sampling events) can then be transported into the spreadsheet, statistically analyzed, and graphically displayed. Examples of popular database software packages are dBase, FileMaker Pro, and FoxPro.

An effective way to display your data is on a map of the stream or watershed. This clearly illustrates the relationship between land uses and the quality of water, habitat, and biological communities. This type of graphic display can be used to effectivel y show the correlation between specific activities or land uses and the impacts they have on the ecosystem. Simple personal computer-based mapping packages are available. They allow you to enter layers of data and conduct spatial analysis of that data.

Systems that allow you to map and manipulate various layers of information (such as water quality data, land use information, county boundaries, or geologic conditions) are known as Geographic Information Systems (GIS). They can vary from simple systems r un on personal computers to sophisticated and very powerful systems that run on large mainframes. For any GIS application, you need to know the coordinates of your sample sites--either their latitude and longitude, or some alternate system such as an EPA River Reach File identifier. You can also locate your sites on a topographic map that can be digitized on to an electronic map of the watershed. Once these points have been established, you can link your database to the points on the map, query your data base, and create graphic displays of the data.

Powerful GIS applications typically require expensive hardware, software, and technical training. Any volunteer program interested in GIS applications should consider working in partnership with other organizations such as universities, natural resource a gencies, or large nonprofit groups that can provide access to a GIS.

Many people are capable of writing their own programs to manipulate and display data. The disadvantage of using a "homegrown" software program, however, is that if its author leaves, so too does all knowledge about how the program works. Commercial software, on the other hand, comes with consumer services that provide over-the-phone help and instructions, user's guides, replacement guarantees, and updates as the company improves its product. Also, most commercial programs are developed to easily import and export data in standard formats. This feature is important because if you want to share data with other programs or organizations all you need are compatible software programs.

STORET

EPA's national water and biological data storage and retrieval system, STORET, is being modernized and will be available in 1998-1999. Volunteer programs are encouraged to enter their data into the modernized STORET. Individual systems will "feed" data to a centralized file server which will permit national data analyses and through which data can be shared among organizations. A specific set of quality control measures will be required for any data entered into the system to aid in data sharing. For more information, see the EPA web page at www.epa.gov/owow/STORET/.

< Previous · Table of Contents · Next >


Office of Wetlands, Oceans & Watersheds Home
Watershed Protection Home | Monitoring Water Quality Home


EPA Home | Office of Water | Search | Comments

Watershed Office of Water U.S. Environmental Protection Agency