Eric White
My model was a weighted ensemble of polls and fundamental data about each state. The fundamentals part of the model considered the percentage of the state population that is white, black, and over age 65, as well as the unemployment average for that state for 2020. The polls aspect of my model, which was weighted overwhlemingly more than the fundamentals, weighed the sample size, recency, and quality of the polls to create an average of polling in each state. When polling data was not available, 2016 election performance was used as a proxy, as this tends to be an effective way to predict the result, particularly in safe states that are not likely to see a massive partisan swing. Above is my map predicting which candidate would win each state. My predictions for the two-party popular vote share in each state are given below, with italicized states being ones that I expected to be within 10 points and bold states being ones that I got wrong in my prediction.
State | Biden | Trump | Margin |
---|---|---|---|
Alabama | 38.59333 | 61.40667 | -22.8133335 |
Alaska | 44.9704 | 55.0296 | -10.0592015 |
Arizona | 52.29683 | 47.70317 | 4.5936599 |
Arkansas | 32.60462 | 67.39538 | -34.7907626 |
California | 60.00608 | 39.99392 | 20.012162 |
Colorado | 66.29099 | 33.70901 | 32.581973 |
Connecticut | 65.9992 | 34.0008 | 31.9984097 |
Delaware | 54.77868 | 45.22132 | 9.5573532 |
Florida | 52.33308 | 47.66692 | 4.6661634 |
Georgia | 49.57594 | 50.42406 | -0.8481217 |
Hawaii | 69.45923 | 30.54077 | 38.9184529 |
Idaho | 27.23193 | 72.76807 | -45.5361431 |
Illinois | 52.78234 | 47.21766 | 5.5646771 |
Indiana | 46.2933 | 53.7067 | -7.4134013 |
Iowa | 47.7414 | 52.2586 | -4.5171941 |
Kansas | 47.17733 | 52.82267 | -5.6453317 |
Kentucky | 38.38104 | 61.61896 | -23.2379185 |
Louisiana | 37.9485 | 62.0515 | -24.1029913 |
Maine | 58.13816 | 41.86184 | 16.2763246 |
Maine CD-1 | 59.61712 | 40.38288 | 19.23424 |
Maine CD-2 | 51.58011 | 48.41989 | 3.160226 |
Maryland | 63.26843 | 36.73157 | 26.5368671 |
Massachusetts | 68.41095 | 31.58905 | 36.8219098 |
Michigan | 53.829 | 46.171 | 7.6580061 |
Minnesota | 50.15969 | 49.84031 | 0.3193855 |
Mississippi | 41.55102 | 58.44898 | -16.8979535 |
Missouri | 45.40799 | 54.59201 | -9.1840274 |
Montana | 37.72969 | 62.27031 | -24.5406214 |
Nebraska | 39.72492 | 60.27508 | -20.550163 |
Nebraska CD-2 | 53.03052 | 46.96948 | 6.061034 |
Nevada | 51.65607 | 48.34393 | 3.3121466 |
New Hampshire | 63.00037 | 36.99963 | 26.0007314 |
New Jersey | 60.29657 | 39.70343 | 20.5931497 |
New Mexico | 55.54129 | 44.45871 | 11.0825785 |
New York | 64.20354 | 35.79646 | 28.4070745 |
North Carolina | 48.94769 | 51.05231 | -2.1046133 |
North Dakota | 39.61115 | 60.38885 | -20.7776914 |
Ohio | 48.0239 | 51.9761 | -3.9522071 |
Oklahoma | 40.54972 | 59.45028 | -18.9005671 |
Oregon | 55.91936 | 44.08064 | 11.838716 |
Pennsylvania | 52.75727 | 47.24273 | 5.5145449 |
Rhode Island | 58.85695 | 41.14305 | 17.7139041 |
South Carolina | 46.03663 | 53.96337 | -7.9267354 |
South Dakota | 41.65608 | 58.34392 | -16.6878349 |
Tennessee | 37.11921 | 62.88079 | -25.7615867 |
Texas | 47.97994 | 52.02006 | -4.0401226 |
Utah | 35.36543 | 64.63457 | -29.2691433 |
Vermont | 68.64464 | 31.35536 | 37.2892791 |
Virginia | 55.26281 | 44.73719 | 10.525627 |
Washington | 60.19279 | 39.80721 | 20.3855821 |
West Virginia | 38.99759 | 61.00241 | -22.0048129 |
Wisconsin | 52.3016 | 47.6984 | 4.6032081 |
Wyoming | 20.71385 | 79.28615 | -58.572306 |
Ultimately, I predicted Biden would win with a 320-218 Electoral College margin, off by 14 delegates from the actual amount of 306-218. The two states that I predicted incorrectly were Florida, which I erroneously had going for Biden, and Georgia, which I predicted would go to Trump. Additionally, I predicted Maine’s 2nd Congressional District incorrectly, as this was the district where Trump most massively overperformed the polling and did so by an amount that was very difficult to account for in a model.
My model tended to overpredict support for Trump, but nevertheless, I was satisfied with the result of the model, as I ended up with a root mean square error of 4.213276 for my statewide predictions. This is relatively low and although I wish that I had performed better in certain states, the ultimate result of the model was largely accurate. One reason as to why I could have systemically underrepresented Trump’s support in my predictions is that I did not account for geography, and there is a very large and increasingly growing partisan divide among urban and rural voters, as David Hopkins argues in his book Red Fighting Blue. Instead of racial demographic factors, it may have been more worthwhile to include urban, suburban, and rural demographic breakdown factors into my model, as it is possible that this would have been more predictive. Racial demographic factors would have also been, to some extent, baked into a demographic analysis because there is a racial component to the urban-rural divide. Including these values could have helped to make the fundamentals aspect of my ensemble more relevant, as I felt that the information conveyed in that part of the model may have been extraneous.
Another trend, at least from looking at a map of my errors, is that there seems to be a clear geographic trend to these errors. In particular, I tended to favor western states in Trump’s direction and northeast states in Biden’s direction to an unnecessary degree in my model. This could be a result of the polls being incorrect or just a function of incomplete data, but I would also imagine that some of it has to do with the demographic variables that I included in my model. It is possible that these states in the northwest tend to be whiter and that my model underestimated Biden’s support from Black voters. To contrast with the Northeast, the demographics aspect of my model may have favored Democrats in these areas. It is hard to explain, at least based on that rationalization, why California favored Trump more than it should have in my model and why West Virginia favored Biden more than it should have in my model, but it is possible that these states, which both have higher than average unemployment, could be drawn to one extreme or the other. Additionally, these are highly partisan states, so in addition to there not being sufficient polling in these areas, it is also possible that the modeling process that allowed me to obtain generally accurate results in close states is not as applicable to safe states because of the partisan dynamics. For example, in California this year, Biden won a greater percentage of the vote than Clinton in 2016, yet won by a smaller margin than she did, in large part due to third party voters. The broader conclusion to be drawn from that phenomenon and the deficiencies that the model had in closer states is that there is more room for error in these partisan states, so a shift of a few points in one direction or the other might look meaningful when examining a graph like the one below, but in the grand scheme of things, when analyzing the election, these deviations are largely irrelevant.
Ultimately, of all of the possible explanations I provided, I think the last one makes the most sense. Stronghold states like California, Illinois, Idaho, West Virginia, and Wyoming were the states with the greatest errors in their prediction, which gives credence to the idea that the model struggles with highly partisan states. The trend of large errors in safe states is a much stronger trend than the geographical trend mentioned above, though that is certainly relevant as well.
Below are two graphs, the first of which shows all states and how far off my predictions were. The second graph isolates only the states that I predicted to be within 10 points with my model.
Looking first at the hypothesis that I should have included geographic variables, it would make sense to build a similar model, but instead of including a regression based on whether voters were white, black, or senior citizens, I would run a regression using urban, rural, and suburban (or I could just include the senior citizen variable in place of the suburban variable, as presumably the urban, rural, and suburban variables would not be linearly independent). I would then bring that regression into my weighted ensemble and test to see if the results were superior in this iteration of the model, not only for 2020, but also for 2016, as I retroactively predicted the 2016 election in order to see if my model was rational. If the RMSE was lower or I was able to improve the results of my model, then I might be able to conclude that this was a superior approach, or at the very least, this modeling choice would have some merits moving forward.
I would assume that the geographic and hyperpartisan-state hypotheses would be able to be tested in a similar way. Specifically, I could attempt to use my model on previous years to see if the same phenomena are seen. If it is the case that the same states tend to be pulled in one direction or the other, then there might be evidence that the model has a certain geographic bias. Looking specifically at the partisan states, I could also potentiall artificially increase the polling margin for a randomly chosen swing state to see if the prediction would remain as far off in order to see if the model truly was less effective for partisan states. In this sense, I would be keeping the demographic variables constant but changing the polling and if the discrepancy was still meaningful, I might look to change certain variables in my regression or incorporate historical data for non-swing states in order to improve the predictions in these states.
One item I wish that I had addressed in my original model was representing turnout. I had made attempts at modeling turnout in past blog posts and although I was able to predict the probability that an individual vote would go to a given candidate based on turnout, these models felt relatively arbitrarily decided and I did not feel confident in adding them to my model. Also, an assumption that I operated under was that in general, high turnout would mean better Democratic performance. This was not explicitly stated in the modeling for turnout that I had done, but the historical data pointed in that direction and there is research that indicates the same phenomenon. This was not necessarily the case this year, and although increased turnout seemed to lead to better performance for Biden, particularly in Georgia, Michigan, and Wisconsin, it would be hard to generalize this to the entire population as a whole, which is what I assumed a model for turnout would do. I am unsure how I would be able to properly model turnout, particularly in a year such as this, where one could make the argument that COVID-19 would decrease turnout, but increased vote-by-mail would increase turnout. Not only would I struggle to be able to retroactively determine how to accurately model turnout (something that also contributed to polls being off this year), I would be unsure as to how to apply that methodology to future elections that would not be impacted by COVID and may not have the same portion of the electorate voting by mail.
Additionally, I wanted to consider a way to model the relationship between states in order to create a more robust model. For example, if I know that Michigan will go to Biden, then it is reasonable to assume that Wisconsin will also go to Biden, as these states tend to have similar voting patterns. If I were to do this, I would probably consider which states voted together most often in presidential elections as well as geography and the partisan makeup of their legislatures, which may give me a clue as to how similar certain states are to each other in terms of partisanship. This could help my model become more effective at predicting certain outcomes, and the best and worst cases scenarios that I predicted might make more sense for each candidate; that being said, there could be drawbacks to this approach, as states are not monolithic and may have different underlyng trends; one might assume that Georgia and Florida would vote similarly based on these criteria outlined above, but they did not vote together in this election and I did not predict them to vote in the same way in my model either (though my model was incorrect about both states).
In the end, although I am happy that my model predicted the correct winner and many of the states, there are still a number of areas of improvement. I think it is certainly possible that my model depends too much on polls and it could be susceptible to error in that regard. I did not appropriately account for the urban-rural divide and generally struggled to predict Trump’s support across the board, as he tended to overperform my model, with the exception being some western states. My model had a root mean square error of 4.213276, and althoug this is reasonably accurate, it is still important considering one can produce an exceptionally accurate result in RMSE terms just by looking at the 2016 results while generating no other statistical inference.