DS from S: Chapter 13

Naive Bayes

Aug 06, 2024

Thought before starting

Really looking forward to getting some practice with Bayes’ Theorem. I didn’t encounter it in any of my statistics courses, which I think was an oversight in the curriculum that kind of made sense, since so much of the statistics in physics can be treated as a normal distribution kind of thing, and because it’s often easy to design most undergraduate labs in such a way that the cause of an experiment’s outcome is clear in physics.

As an aside, the causal relationship of the measurement I spent my graduate years doing was much less clear. It was a measurement of the gravitational constant, big G, and it’s the least well known of the physical constants. Currently we know the value of G to around 22 parts per million; the mass of an electron is known to about 300 parts per trillion, so 0.0003 parts per million, or about 70,000 times more precise. Some of that is caused by the experimental precision, but part of it comes from the fact that it’s easy to overestimate how good your experiment is. There are experiments with less than 22 parts per million uncertainty, but they disagree with each other by more than that. My experiment, by the way, was not one of them. We should have had a more precise experiment than that, but when we changed something that shouldn’t have mattered, our value was different. My whole thesis was basically a survey of things that weren’t the cause.

OK, on to Bayes!

Thoughts while reading

* Sorry for the asterisks instead of bullets, Substack isn’t working well on my browser at the moment (Firefox). I’ve turned off all of my extensions for this page but that doesn’t seem to be fixing the problem.

* Oh nice, we’re using the fake social network introduced in chapter 1 that we haven’t seen since. Or if we have, I no longer recall it.

What I learned

How to write a naive Bayes filter!

What I liked

Good example to work with, and I enjoyed checking out the dataset

What I disliked

Other thoughts

I’d have liked it if the author explained what choice for

Vocabulary terms

Pseudocount: modified count of a value used in a naive Bayes filter to ensure that something that is 100% indicative of whether a term is in one category or another in the test set doesn’t result in the model assigning the word a 100% predictive ability

William’s Substack

Discussion about this post

Ready for more?