Friday, January 24, 2025
spot_img

Why is it important?

Date:

Share post:

spot_img
spot_img

Statistical Literacy

By Rajiv Gupta

Suppose you take your temperature on two successive days. On the first day it is 98.2 degrees, on the second day it is 98.8 degrees. What would you infer from these readings? More importantly, what course of action would be warranted as a result? A lot of people take notice and act on information not very different from the example cited above. We often read items in newspapers with statistics to make a point. The numbers could be about the economy, the number of accidents, or the crime rate, etc. We read the numbers but not everyone reflects on what the numbers mean. Numbers convey meaning and it is up to the reader to discern the meaning. Let us take an example.
A leading English national daily had a news item titled: “Fatal Accidents Down, Traffic Cops Cite Proactive Measures” on November 14, 2023, which gave statistics of road accidents in Delhi from 2019 to 2023 (up to October 31). There were 1433 fatal accidents in Delhi in 2019, 1163 in 2020, 1206 in 2021, 1428 in 2022, and 1141 up to October 31 in 2023. The Delhi Traffic Police cited these numbers to convey that there was a 4.7 per cent reduction in fatal accidents between 2022 and 2023 and that was due to proactive measures taken by the police. Is this a correct claim? Does it mean that in 2024 we should expect a further reduction in fatal road accidents?
Statistics can either convey relevant information which can lead to sound decisions, or they could be misinterpreted leading to action when none is warranted. In the temperature example at the beginning of the article, it should be clear that no action needs to be taken. Why? Because typical human temperatures fluctuate in a range, the range depending on the individual circumstances. If any medication is taken because the body temperature reaches 98.8, it can lead to undesirable side-effects. Similarly, we need to examine the data in the newspaper article. The observations will be valid not just for the article but for a host of numbers seen in newspapers, television, and other media.
The reason we should not start medicating ourselves when the body temperature reaches 98.8 is because body temperature has a natural range of variation. This variation is due to various factors such as the ambient temperature, our clothing, what we may have eaten, whether we have just exercised, etc. Dr. W.E. Deming, noted statistician and management guru who helped the Japanese industry become world leaders, called this natural variation caused by “common causes.” If the statistic (in this case the body temperature) is within some defined limits, no action is required. But if it goes outside these limits, then Deming said that we need to investigate and possibly take action. So once the body temperature crosses, say 100 degrees, it may be a sign that one needs to see a doctor.
Returning to the accident data, If we plot the data points, something interesting emerges.
What does the graph tell us? Should we conclude that some specific action resulted in a decrease in accidents? Or does it look like random variations in data which do not reveal any specific pattern or trend? Even without defining any limits which would indicate any unusual pattern of behaviour, it is easy to see that the number of accidents over the five-year period reported in the article is relatively stable. Not many people would interpret the reduction from the fourth to the fifth year as remarkable.
A natural question which a reader can ask is “So what?” What is the harm in misreading natural variation and taking action to compensate for any effect that we feel necessary? Is it necessary for the problem to get big before we act on it? Should we not ward of problems by taking preventive measures?
Deming wrote that if the data only shows natural or random variation, any action in response to a change from one period to the next would make the system unstable with the potential to increase the variation in the data. In the case of the body temperature example given in the beginning of this article, most healthcare professionals agree that unnecessary medication to counter random changes in the body temperature can lead to problems. We should take steps to improve one’s health such as eating well and exercise. But action should not be taken to counter a slight blip in the body temperature.
Similarly, in the case of accidents, the authorities ought to take systemic actions to reduce the overall incidence of accidents. Actions such as better patrolling, better lighting, better road maintenance, etc. should result in a reduction in accidents. But these actions should not be taken as a knee-jerk reaction to a slight change in one year’s accident data.
According to Deming, any unnecessary intervention based on short term changes in data would make the system unstable, with higher highs and lower lows. However, when the change is significant, i.e., the data crosses the limits beyond the natural variation, investigation into the cause of that occurrence is warranted and action needs to be taken. In the body temperature analogy, if the temperature were to cross 100 degrees, it often signals an infection for which some treatment becomes necessary.
Examples of misguided overreaction to data are seen in several walks of life. Perhaps one of the more glaring examples is when a company announces a drop in the sales or profit number compared to the previous quarter or year. The immediate reaction of the stock market is to selling of the stock which often leads to a drop in its price. Sales and profits of organizations do not always go up in a straight line. Any panic selling of a company’s stock in response to a poor quarter or year is unwarranted as long as the company remains healthy.
This article was written to emphasise the need to develop literacy in reading statistical data. Data is important for managing organisations, communities, and nations. However, collecting data involves time and cost. Therefore, we need to make sure that the data is put to proper use. In addition to the cost of collecting data, incorrect reaction to random fluctuations in the data can lead to worsening the system. The message should be clear. We need a better understanding of how to interpret statistical data.—INFA

 

Previous article
Next article
spot_img
spot_img

Related articles

BSF declares ‘Ops Alert’ along Indo-Bangladesh Border ahead of R-Day

Kolkata, Jan 23: The Eastern Command of the Border Security Force (BSF) has commenced 'Ops Alert' along the...

Micheal Martin confirmed as new Irish Prime Minister

London, Jan 23: Fianna Fail leader Micheal Martin has been confirmed as the new Irish Prime Minister after...

LoP Rahul Gandhi faces Bengal’s ire for mentioning Netaji’s date of death in social media post

Kolkata, Jan 23:  Leader of Opposition (LoP) in the Lok Sabha, Rahul Gandhi on Thursday faced criticism from...

South Africa expresses concern over US announcement to withdraw from Paris Agreement

Johannesburg, Jan 23: South Africa has expressed "profound regret" over the US decision to withdraw from the Paris...