Naturally occurring arsenic is poisonous above certain levels, and may be present in some groundwater sources. This is a problem in Bangladesh, where citizens get their water from local wells. Arsenic contamination varies widely: while one family’s well may be safe, a neighbor’s well may have unsafe levels of arsenic. An NGO focusing on water safety recently completed a survey of almost fifty thousand households in three regions of Bangladesh. The survey interviewed households to better understand two main phenomena. First, what factors affect well switching? Second, does arsenic cause a financial burden for families with unsafe levels of arsenic in their water? For the purposes of this study, wells with any arsenic presence should be considered as problematic for human consumption, with higher levels being increasingly toxic.
The dataset wellsdata.Rds contains information on 49274 households. These households were educated about arsenic poisoning and provided information about arsenic levels in their local wells. If the levels were unsafe, households were encouraged to switch their water source to other local options. The households were visited one year later, and data was collected on whether or not the family had switched wells. This is recorded in the variable Switch, equal to 1 if the household switched and 0 otherwise.
The presence of arsenic in a well does lead to an increase in percentage of household income spent on healthcare. When there is no arsenic in the well, the percentage of household income spent on healthcare is on average .07%. When there is arsenic in the well, the percentage of household income spent on healthcare is on average .11%.
When analyzing this effect by region, it is found that for Region 1 there is an average of 0.06% increase in percentage of household income spent on healthcare when arsenic is present. For Region 2, there is an average of 0.02% increase in percentage of household income spent on healthcare when arsenic is present. For Region 3, there is an average of .05% increase in percentage of household income spent on healthcare when arsenic is precent.
Therefore, the regional findings suggest that in term of impoverishment increased through increased spending on healthcare caused by arsenic found in wells, Region 1 has the largest difference when comparing percentage of household income spent on healthcare when there is arsenic present in the well.


Individuals were surveyed about their relationship with the well owner, with the assumption that information about the well could be spread differently depending on how close of a tie that the individual had with the well owner. People who reported that the well owner was close family on average had a 70% likelihood of switching wells. People who reported that the well owner was a neighbor on average had a 50% likelihood of switching wells. Lastly, though not significant and therefore uncertain due to a wide confidence interval, people who reported that the well owner was in the same bari had on average a 70% likelihood of switching wells.

If people know that they the status of the well is unsafe, on average, they have an 86% proability of switching. If people don’t know the status of the well, they also have an 86% percent chance of switching on average. However, the people who don’t know the status of the well also have the most uncertain outcomes. The confidence interval ranges from a 68% cchance of swtiching to a 95% chance of switching, and the results are not significant. This indicates that people who don’t know the status of a well are a vulnerable group to the negative effects of wells that are contaminated by arsenic.

(Dropped responses that were not coded in codebook, 1, 2, and 3 included only)
Individuals found out about the status of the well in various ways. When surveyed about how they found out about the status of the well, people who reported that they knew about the status of the well through a CU Result were on average 86% likely to switch. People who said they found out about the status of the well through the paint were on average 58% likely to switch. People who found out about the status of the well through other means were on average 60% likely to switch, but the results of people who found out about the status of the well through other means are not significant and therefore uncertain. These results indicate that a CU Card is again quite an important determinanat for well switching in households with arsenic in their wells.

People who have a CU Medical card are most likely to switch wells when a well is found to be contaminated with arsenic. On average, people who have a CU Medical card are 72% likely to switch. People who do not have a CU Medical card are only about 51% likely to switch. However, this result is not significant because it is more uncertain, the confidence interval of switching ranges from 40% to 61%.

The probability of switching based on status which was visible on the well was the highest when there was a CU plate present, with an average of 95% switching based on wells that had a CU plate present. Comparatively, red paint (which is inferred to indicate a well that is not safe), only had an average of 80% switching and was not a significant result.

Meanwhile, the new plate indicators on the well seemed to successfully predict switching behavior. When the plate indicated that the well was unsafe, there was an 85% probability of switching on average. When the plate indicated that the well was safe, there was only a 17% probability of switching.

One of the ways to increase switching behavior is also creating a clear and widely understood method for determining which wells are safe to use and which are not. From the codebook and the results of the analyses on switching behavior and indicators of status of the well (Newstat and Oldstat), it seems that there are certain indicators that allow for a greater probability of switching than others.
A clear indicator of switching behavior is also strongly linked to how information is spread, and what information people have. People who did not know the status of the well had a lot of uncertainty about whether they would switch or not compared to people who did know that the well was unsafe. Combined with a more clear and systemitized way of indicating which wells are safe and which are not safe, a way of educating key people in the village about the safety of wells is crucial for increasing switching behavior.
By key people, I mean well owners and leaders in villages. Households tended to switch more often if they found out about the status of the well through close family or the same bari. If well owners were more educated on how to disemminate that information, it would spread not just to their family and friends, but also to others who used their well. They could also be educated on best practices for testing, and then placing clear markers on the well to help people know that the well was unsafe.
Finally, in my analyses I find that having a CU Medical Card greatly increases switching behavior. If possible, NGO and policy makers should focus on creating accessible medical services that can be coupled with education initiatives to ensure that people have access to clean and healthy water and are making informed decisions about the wells they get water to drink and cook with.
Click here to see code: https://github.com/lama-ahmad/DataAnalysis-R/blob/master/arsenicinwells.Rmd
| Dependent variable: | |
| pct_hhinc_healthcare | |
| anyArs | 0.039*** |
| (0.001) | |
| Constant | 0.074*** |
| (0.0001) | |
| Observations | 49,274 |
| R2 | 0.055 |
| Adjusted R2 | 0.055 |
| Residual Std. Error | 0.015 (df = 49272) |
| F Statistic | 2,865.240*** (df = 1; 49272) |
| Note: | p<0.1; p<0.05; p<0.01 |
| Dependent variable: | |
| pct_hhinc_healthcare | |
| anyArs | 0.062*** |
| (0.001) | |
| RegionRegion 2 | -0.001*** |
| (0.0002) | |
| RegionRegion 3 | -0.009*** |
| (0.0002) | |
| anyArsTRUE:RegionRegion 2 | -0.025*** |
| (0.002) | |
| anyArsTRUE:RegionRegion 3 | -0.056*** |
| (0.002) | |
| Constant | 0.077*** |
| (0.0001) | |
| Observations | 49,274 |
| R2 | 0.138 |
| Adjusted R2 | 0.138 |
| Residual Std. Error | 0.014 (df = 49268) |
| F Statistic | 1,579.500*** (df = 5; 49268) |
| Note: | p<0.1; p<0.05; p<0.01 |
In part 2, I subsetted the data to only include responses in which anyArs was equal to True. I made this decision because we are only interested in results that explain the switching behavior of people who did have arsenic in their wells.
In all questions, if there were results that seemed to be miscoded or that I could not infer what they meant from the code book, I did not include them in my predictions for Switching and only included variables that I could be sure of what the response meant based on the code book.
| Dependent variable: | |
| Switch | |
| Rel2 | -0.020 |
| (0.331) | |
| Rel3 | -0.868** |
| (0.374) | |
| Constant | 0.868*** |
| (0.121) | |
| Observations | 410 |
| Log Likelihood | -251.887 |
| Akaike Inf. Crit. | 509.774 |
| Note: | p<0.1; p<0.05; p<0.01 |
| Dependent variable: | |
| Switch | |
| Status1 | -3.498*** |
| (0.319) | |
| Status2 | 0.006 |
| (0.565) | |
| Constant | 1.866*** |
| (0.177) | |
| Observations | 410 |
| Log Likelihood | -166.850 |
| Akaike Inf. Crit. | 339.700 |
| Note: | p<0.1; p<0.05; p<0.01 |
| Dependent variable: | |
| Switch | |
| Howknow2 | -1.508*** |
| (0.349) | |
| Howknow3 | -1.443 |
| (0.949) | |
| Constant | 1.849*** |
| (0.261) | |
| Observations | 207 |
| Log Likelihood | -105.339 |
| Akaike Inf. Crit. | 216.678 |
| Note: | p<0.1; p<0.05; p<0.01 |
| Dependent variable: | |
| Switch | |
| Cucard1 | 0.914*** |
| (0.247) | |
| Cucard2 | 16.522 |
| (581.975) | |
| Constant | 0.044 |
| (0.211) | |
| Observations | 410 |
| Log Likelihood | -241.227 |
| Akaike Inf. Crit. | 488.453 |
| Note: | p<0.1; p<0.05; p<0.01 |
| Dependent variable: | |
| Switch | |
| Oldstat1 | 1.805*** |
| (0.658) | |
| Oldstat2 | 0.296 |
| (0.406) | |
| Oldstat3 | -2.439*** |
| (0.409) | |
| Constant | 1.139*** |
| (0.287) | |
| Observations | 274 |
| Log Likelihood | -123.021 |
| Akaike Inf. Crit. | 254.042 |
| Note: | p<0.1; p<0.05; p<0.01 |
| Dependent variable: | |
| Switch | |
| Oldstat1 | 1.805*** |
| (0.658) | |
| Oldstat2 | 0.296 |
| (0.406) | |
| Oldstat3 | -2.439*** |
| (0.409) | |
| Constant | 1.139*** |
| (0.287) | |
| Observations | 274 |
| Log Likelihood | -123.021 |
| Akaike Inf. Crit. | 254.042 |
| Note: | p<0.1; p<0.05; p<0.01 |