In these diagrams, you can see that, in fact, there are four outcomes. Two of them have one tail and one head. So P( one head and one tail ) = 1/2.
If that summary doesn’t tell you all you want to know, read on…
Basics
A probability is a number. It represents the likelihood of something happening. Furthermore, it has to be between zero and one.
An event with a probability of zero is impossible; it will never happen. If the probability is small—close to zero—it’s unlikely. If it’s close to 1/2, it’s (literally in the case of a coin) a toss-up. A high probability—close to one—is very likely, and a probability of 1 represents an event that will certainly happen.
There are two ways to figure out a probability: empirically and theoretically. Let’s start empirically.
Suppose that Denise usually arrives in class first. Usually. Not always. You count for two weeks and she was first into the class seven out of ten times. The next day, will she be first? Probably, but not for certain. We can assign a number to how probable it is: 0.7, that is, 7/10.
That’s an empirical probability. It’s based on experience. You look at what happened in the past, and use that to make predictions about the future. A cool thing about probability is that you don't have to make firm, black-or-white, yes-or-no predictions: your prediction is that there’s a 70% chance that she will be first. The "70%" recognizes that you’re not certain, but it lets you be quantitative about how certain or not certain you are.
Vocabulary note: when you count up how many times something happens, the number you get is called a frequency. (This conflicts with the definition in science, where frequency usually means how many times something happens in some time period.) So in ten days, the observed frequency of Denise being first was 7.
A theoretical probability is when you have a way to know the probability from something other than experience. For example, if you roll a die, the probability that you roll a four is 1/6, or about 17%. That’s because there are six numbers on your die, and each of the numbers is equally likely.
Probability Models
Don’t be frightened by the term probability model. It’s best explained by example.
Suppose you have 28 students in your class. You put every student’s name on a card and shuffle the deck. Whoever’s name is on the top card gets the first question. What’s the chance that Emmitt is first? 1/28.
You have just used a probability model. You have assumed that each card is equally likely because you shuffled. In this case, to generalize, you have used the (common, excellent) model that if you have n things that are equally likely, the probability of each is 1/n.
Not all situations work that way, though. Suppose you put a strawberry and nine blueberries in a bag, reach in, and pull one out. What’s the chance that you pick the strawberry? One-tenth? No way. You can tell the strawberry from the blueberries by feel. Even if you didn’t care, and tried to be fair, they’re different sizes, and that will affect the likelihood of picking the strawberry. So to find the probability, you try it. If you did it 50 times (replacing the fruit each time, of course) and picked the strawberry 17 times out of 50, you would have a 34% empirical probability of picking it. You could use that in a prediction for the future.
This berry experiment also uses a probability model, this time an empirical model based on observed frequencies.