Related Products:

Society for Amateur Scientists

 

 

 

 

 

Sponsored by:

Design Your Experiments

by Kevin Kilty

Sheldon Greaves and Norman Stanley have discussed a series of articles for the SAS Bulletin regarding the design of experiments, or what some people call DoE. This is a good idea. Not only is there paltry literature available for amateurs regarding the subject--the extensive literature is tailored to very specific audiences and is verbose beyond reason--but also, the point of view changes quite a lot from one audience to another. Since Norman is busy with his excellent series on chemistry, I have decided to get things underway. However, I hope this eventually involves other people as well, even if this means only through their feedback on the topic de jure. Please feel free to reach me directly.

Let me take this first note in the series to outline a few issues. Then, over the course of many weeks, I'll tackle topics one at a time. What I hope to do is eventually equip an experimenter's toolkit. Some of the tools are mathematical, some are procedural, and some are philosophical.

What is experimental design?

Engineers, biologists, and physicists each have a slightly different take on experiments. I am in a good position to see this because my formal understanding of DoE comes from engineering, while my formal education is as a geophysicist. Engineering experiments often have a different motivation than scientific experiments. Engineers typically have an idea in working form; that is, they have a process or product that works, but they need to figure out what factors influence the process or product so they can make improvements. They design experiments to screen many factors and find the most important ones. Engineers also use experiments to make robust designs. When products or processes become available for use, they have many random influences which the engineer has no control over. The idea of a robust design is to make a product or process work well even in circumstances that the engineer could hardly have forseen in advance.

In chemistry and biology there are circumstances in which the experimenter would like to screen factors to decide which are the most important. In these situations the engineering model of DoE is useful. However, there are scientific issues where the goal is not to find out what factors most influence a process, but, rather, to decide between or among competing theories. The engineering model of experimental design is not so useful, here, and what is required is a much more philosphical perspective on experimental design. To take a logical positivist approach for a moment, the first step in experimental design is to decide which measurements separate one theory from another and allow us to reject theories that fail. Good theories make predictions and the essense of an experiment is to decide exactly which predictions to test. For example, quantum theory predicts that specific heat of an insulator crystal should vary as temperature to the 3rd power at low temperature. Testing the adequacy of this theory means making measurements of heat capacity as we vary the temperature, and DoE in this instance is not concerned with screening factors, but in how to make measurements with the resolution needed to disprove the theory.

What do we need to begin designing an experiment?

Experiments work best if we design them in the light of everything we know about something. Learn all you can about a subject before embarking on experiments. It is perfectly alright to be motivated to study a subject through accidental, serendipitous observations of something unusual. This happens to me a lot. But it makes little sense to begin doing experiments in a state of total ignorance, especially when you can remove that ignorance easily through library research or talking with someone who has an interest in the topic. This is what makes the SAS forum such a valuable resource. The Forum lets you find people with particular interests. In fact, the more prepared you are about a subject, the more likely you are to notice unusual little things that lead to genuine discoveries. As Louis Pasteur said, "Chance favors the prepared mind."

Please do not interpret what I just said as meaning that there is no reason to repeat what other people have done. Many mistakes in science are discovered through repetition of a single experiment, and it pays to convince yourself that you can repeat other peoples' results before launching in some new direction.

Recently I read a fun little book entitled "How to get ideas." It is written by an advertising man, and he documented how it is that many creative people--advertising people included-- get ideas by becoming thoroughly immersed in a subject. This is an initial step in most research. Never overlook its importance.

Observation versus experiment

Some sciences, like chemistry, physics, and biology, lend themselves to carefully controlled experiments. Others, in contrast, have to make do with observations of the experiments that nature provides. This does not mean that design of experiments doesn't apply, but that the issues of design now focus on topics such as types of equipment to use, where to make observations, how often, and deciding on the needed resolution. These are not topics of formal DoE, per se, but I'll examine them in detail. One other interesting issue to examine is "When do simulations act as legitimate experiments?"

Resolution and propagation of error

Rarely is it possible for someone to take a direct reading of some parameter. Generally we have to transform a reading into a derived value that is of direct interest. This means that all sorts of factors, such as corrections, instrumental constants, and calibration are involved in the experiemnt. Each of these factors provides an avenue by which errors propagate into a result. The formal analysis of error propagation not only helps quantify the uncertainty in measurements, it also helps identify those factors which contribute most to uncertainty and helps us design better experiments as a result.

Inference and Inversion

When we obtain data from an experiment or observation, what shall we do with it? In some instances the result of the experiment is of interest itself. In such a case I would likely just report the result along with some assessment of its incertainty, and then summarize what I could infer from this. Statistical tools of various sorts are helpful, here. In other cases I will wish to obtain values for the parameters of some model from my observations, and what is called for is regression, or some other form of inverting the observed values into model parameters. This was a big topic of research in geophysics 25 years ago. In the meantime the idea of inversion has diffused throughout the sciences and engineering.

This brings my outline of issues full circle. Regression is probably the most direct means by which to interpret results of DoE in the formal engineering sense of the term. What engineers call a response surface, a scientist or statistician recognizes as a regression model. Every amateur scientist can benefit from an understanding of regression, and how to perform it with a computer spreadsheet application. In fact, some concepts in models and model building may be a good topic to begin with next time.

Reprinted from: