/r/q?req.query.q -- Subreddit Search

19,831 Subscribers

Is there any application of Martingale theory in economics?

4 Comments

2025/01/30
21:31 UTC

Probability of Patterns in Zodiac Killer Z340 Cipher Appearing

All, I am wanting to get an outside opinion on the probability of patterns appearing in a cipher sent by the Zodiac Killer in 1969. For context he sent in the following cipher which was decoded in 2020 by a team of codebreakers, but there are some unexplained mysteries and one which is a debate in true crime communities is whether the patterns seen below are random occurrences or intentional.

The Z340 cipher is a 340 character cipher which uses what is called a homophonic substitution cipher which means several symbols and letters can be used in place for one letter. So, for most letters they are represented by several symbols and letters. For a full "key" I can provide that as well. There is a transposition scheme in which the original cipher there is a key and then find the correct transposition scheme.

A great video to watch for more full info is a video put out by codebreaker Dave Oranchak and his team:

https://www.youtube.com/watch?v=-1oQLPRE21o

The patterns are seen below:

https://preview.redd.it/ytchbomjwvfe1.png?width=1058&format=png&auto=webp&s=8c83c7b8b717044a2417722eff777b16e16ff6b3

Below is the plaintext version:

https://preview.redd.it/5ft2wwnpwvfe1.png?width=764&format=png&auto=webp&s=93f300cd7f8f4e24133e5ae0816788e0ca53c12a

Below is the "key" to the cipher:

https://preview.redd.it/m6lweaoswvfe1.png?width=2112&format=png&auto=webp&s=4eb19ee9a762c5f6b95bf69764d740d6829feda5

Below is what the plaintext reads when transcribed:

https://preview.redd.it/dskyq8h4xvfe1.png?width=780&format=png&auto=webp&s=a91561011d600f9b331d54ffe08a0f7e38472d31

For more context on the mysterious patterns and other mysteries with this cipher please check out the following video of the youtube channel Lets crack Zodiac Episode 9:

https://www.youtube.com/watch?v=ByMe8D9sxo4

In the above video you can be given more details on this cipher but looking forward to some ideas on what the probability of these patterns are.

Thanks in advance!

3 Comments

2025/01/29
07:28 UTC

Given a 6-sided die (AAAABC), how to calculate probability of AB when rolling 3 dice, ABB when rolling 4 dice, etc

In this specific scenario, I know the probability of AB on 3 dice is 38.89% (84/216) and on 4 dice is ~50.5%(~109/216). What I'm struggling to figure out, and would love an explanation for, is how to achieve these numbers formulaically.

For AB on 3 dice, I've tried every way I can think of to get to the expected %, but it's just not happening. When the # of dice == the # of combination symbols of interest, I'm good (e.g. P(A)*P(B)*P(C)*(n!/a!b!c!), but once # dice > # combination symbols, I'm failing miserably.

I'm also interested in understanding the same for something like ABC, BBC, etc., when rolling 4 dice, though I imagine it's much the same as the former. Seeing examples just helps me piece things together in my head.

Ultimately, I'm wanting to generalize this problem formulaically in order to build it into a program I'm working on. I thought I was done and then realized I could not get this part figured out, which is incredibly frustrating as I know it's much simpler than it seems to be.

Thanks in advance for any help.

10 Comments

2025/01/28
01:07 UTC

Markov Chain guidance?

I'm trying to figure out EV for a game I'm playing.

There are 8 "tasks". These tasks start out as "stone". Your goal is to convert these tasks to "gold" for as few resources as possible.

You do so by refreshing the tasks. Each task has an 8% chance of turning to gold when refreshed, every single time. When you spend a refresh, all tasks that aren't gold will refresh independently. The refresh costs 100 resource units.

Alternatively, at any point in time, you can choose to convert ALL tasks to gold for the price of 400 resource units per task.

Question: what is the optimal strategy to reduce resource usage and convert all tasks to gold?

I think standard probability can only get you so far because you have to start managing "state" transitions and the probabilities between them to calculate EV. Markov Chains seem like an ideal candidate to solving this, but I'm not sure the best way to put this into practice, nor do I know of another potential solution.

Any guidance is appreciated!

4 Comments

2025/01/27
21:17 UTC

Any one know how to solve this?

5 Comments

2025/01/27
03:05 UTC

Can someone check my theory

I have 2 standard decks of cards - 104 cards.

I deal a hand of 11 cards.

I want to know relative probability of getting different types of pairs.

In the deck exist 1S,1S,1C,1C,1D,1D,1H,1H

The chance of getting (at least?) ONE 1 is 1/13 * 11 = 11/13
The chance of getting TWO 1 is 11/13 * 7/103 * 10 = 770/1339

There are 28 ways of getting TWO 1 so 28 * 770/1339 = 21560/1339

There are 13 numbers so the chance of getting any TWO of the same number is 13 * 21560/1339 = 21560/103

The chance of getting TWO 1 of different colours is 11/13 * 4/103 * 10 = 440/1339

There are 16 ways of getting TWO 1 of different colours so 16 * 440/1339 = 7040/1339

There are 13 numbers so the chance of getting any TWO of the same number of different colours is 13 * 7040/1339 = 7040/103

The chance of getting TWO 1 of the same colour but different suits is 11/13 * 2/103 * 10 = 220/1339

There are 8 ways of getting TWO 1 of the same colour but different suits so 8 * 220/1339 = 1760/1339

There are 13 numbers so the chance of getting any TWO of the same number of the same colour but different suits is 13 * 1760/1339 = 1760/103

The chance of getting TWO 1 of the same suit is 11/13 * 1/103 * 10 = 110/1339

There are 4 ways of getting TWO 1 of the same suit so 4 * 110/1339 = 440/1339

There are 13 numbers so the chance of getting any TWO of the same number of the same suit is 13 * 440/1339 = 440/103

I'm not really sure what the final numbers mean or translate to in terms of actual probability, maybe someone can explain what I'm doing here or what I'm doing wrong.

I know that in real life, you would almost always draw at least 2 of the same number unless you sometimes get a straight or disjointed straights.

Sometimes you get a pair of the same card - I'm guessing the chance of this happening is 10 * 1/103 so roughly every 10 hands but I still think this is probably wrong because the chance of getting AT LEAST ONE PAIR is more complicated because when the 2nd card is drawn and is not the same as the first card, the 3rd card has a 2/102 chance of matching either of the first cards and so on until the final card has a 10/94 chance of matching any of the first 10 cards providing no pairs were already found which would further complicate the problem. So if we added all those together you would get 0.5674, i.e. at least every other hand, you'd get at least ONE PAIR

So, I'm still pretty sure this is wrong because I don't think you can just add up probabilities like that, seems like it would need to be some kind of average of them. If you do the same method for getting any 2 of the same number, it would be greater than a 1 probability. So it might need to be averaged, i.e. 0.5674/10 = 0.05674 OR it might just be 10/94.

I know that dealing 14 cards, the 14th card is guaranteed to create TWO of the same number so following the same logic, the chance of getting TWO of the same number in 11 cards would be 70/94 - but it seems like it should be more complicated than this

I don't know where to start thinking about TWO PAIRS

6 Comments

2025/01/26
07:13 UTC

Working out probabilities of hands in a card game we play

Firstly I'd like to say that I have watched the explainer videos about probability of poker hands and I can follow along with that but the game I have has much more complicated combinations of hands and I'm getting stuck.

Simplification of the game:

2 standard packs of cards - i.e. 104 cards (4 suits, 2 colours, 13 numbers, 8 of each number)

A final hand can be made of 11 cards OR 10 out of the 11 cards with 1 card being discarded

The idea is to create a hand of the best value (i.e. the rarest hand)

The game allows any combinations in the form of 'melds' like in Rumi, using:

[Pairs of the same card, this could also be 2 pair, 3 pair and 4 pair (where a 2 pair of the same colour is better than 2 pair of mixed colour)]

[Sets of the same number, these are the combinations that aren't already covered in special pairs, i.e. 3,4,5,6,7 of the same number]

[Runs (straights) of at least 3 numbers in order, these include runs on the same colour and runs on the same suit which have greater significance, A can be high or low]

[Colour - at least 8 of the same colour]

[Flush - at least 5 of the same suit]

Calculating:

I know that the number of total combinations is 104C11

Ultimately I want to calculate the probability of all the possible melds. I started working on the straights.

This would be for R3,R4,R5,R6,R7,R8,R9,R10,R11 (I understand we need to take off the Colour-Runs and Flush-Runs later)

I get that there are 12 ways to make an R3 from an 11 card hand and each way has 8^3, so it's 8 * 8^3 but then each of these combinations also has a number of combinations with the other 8 cards in the hand which could potentially duplicate combinations already counted - this is where I get stuck.

So I then simplified the problem to an 8 card deck with the numbers 1-4 in 2 different suits, dealing a 4 card hand, trying to make an R3:

I came up with the following:

8C4 is 70 combinations

There's 16 different ways to make an R3 (or R4) - But the 4th card complicates it - ultimately we get a pattern of:

5,4,4,3

4,3,3,2

3,2,2,1

2,1,1,0

Which is a total of 40 combinations

Which must mean that there are 30 combinations that don't make an R3, 12 Combinations that don't include any 2's, 12 Combinations that don't include any 3's - 24 Combinations

Leaving 6 combinations which are the pairs - 1,1 w 2,2 OR 3,3 OR 4,4 , 2,2 w 3,3 OR 4,4 and 3,3,4,4

Now I still don't really have a formula to scale this up... help, please :-) This is a great learning opportunity for me.

Ultimately I'd like to get a table for all the meld probabilities and the combinations of the smaller melds in a hand, i.e S4+S3+R4

8 Comments

2025/01/25
02:24 UTC

Basic fraction question

Hello everyone. I need help with fractions. From minute 6:00 to 7:43 I get completely lost. I don't understand why he cancels the 5, nor why he multiplies the top and bottom of the fraction by 9, and why do the 9s cancel out then? sorry, I'm a very beginner. thank you.

https://preview.redd.it/uvwhcuj3ozee1.png?width=562&format=png&auto=webp&s=1a393e9570fddbbd0a5bc30dccc7a6e9e527124c

https://www.youtube.com/watch?v=OByl4RJxnKA

1 Comment

2025/01/24
18:55 UTC

An application of combinatorics

(How) can you solve picross game using combinatorics? I believe integer solutions with restriction to binary variables, I might have forgotten how we write equations (and solve) for that

2 Comments

2025/01/24
17:11 UTC

Odds of straight flush from 9 cards of same suit

Inspired by a recent r/poker post. You are given 9 cards of the same suit. What are the odds that you have a straight flush? More generally, you select m items from a group of n, labeled 1 through m. What are the odds that you have at least p items in a row, where the highest item in the group can also count as the lowest, but not both ways in the same set (such that in a group of 10, {10,9,8} would be a row of 3, as would {10, 1, 2}, but not {9, 10, 1}.

I can't figure out how to come up with a generalized formula.

7 Comments

2025/01/23
14:43 UTC

Stuck in the rim?

My first post on this sub, sorry if it isn't the right flair.

Earlier today I was messing around on the court and got my ball stuck between the rim and the backboard! From just about right where the picture was taken, lol. I tried googling, with no luck, and I have no idea how to do the math on this, so does anyone know how likely it is to get your ball stuck on the rim?

I'm just down a rabbit hole. I need to know!

2 Comments

2025/01/22
22:33 UTC

One boy one girl (yes I know, this again!)

So, I've probably responded to about a million posts on this subreddit, but I don't think I've ever actually posted to it. But I was thinking about the classic "A family has two children and you're told that one of them is a boy, what is the probability that the other one is a girl?" problem, and I got myself into some trouble.

As I myself have pointed out to others on this subreddit, the language about "the other one" is misleading. Stated in an unambiguous way, I think the problem should be stated as:

A family has two children. You have the information that at least one of the children is a boy. What is the probability that the two children consist of one boy and one girl?

Stated this way, the answer is 2/3. (For the sake of simplicity, I'm ignoring gender fluidity for the question.)

But a while back, someone posed a question to me, which I dismissed at the time. But now it's giving me grief. I'll paraphrase them...

You meet someone at a bar that you don't know, but they tell you they have two children. You give them two slips of paper: one says "At least one is a boy", while the other says "At least one is a girl." You tell them to place the correct piece of paper on the bar. If both statements happen to be correct, they are to flip a coin to randomly decide which one to place on the bar.

Let's denote the events:

A = they place down the bit of paper saying "At least one is a boy"

B = they place down the bit of paper saying "At least one is a girl"

C = The two children consist of one boy and one girl

Note that surely all of these are true (aren't they??):

P(A) = 1/2 (accounting for the possible coin toss)
P(B) = 1/2
P(A or B) = 1
P(C) = 1/2
P(C|A) = 2/3
P(C|B) = 2/3

But then:

P(C) = P(C | (A or B))

= P(C and (A or B)) / P(A or B) (Bayes)

= P((C and A) or (C and B)) / 1 (distributive law)

= P(C and A) + P(C and B) ("C and A" mutually exclusive to "C and B")

= P(C|A)P(A) + P(C|B)P(B)

= 2/3 * 1/2 + 2/3 * 1/2

= 2/3

But P(C) = 1/2, contradicting this calculation

Or to put it in natural language:

By the standard argument in this problem, you can conclude that the probability of one boy and one girl is 2/3 based on what is on the paper, regardless of what is on the paper. But the probability of one boy and one girl, absent the information, is 1/2.

I know I must be making a mistake somewhere, but where??

8 Comments

2025/01/21
22:16 UTC

What is the most powerful thing you can do with probability theory?

I seem lost. Probability just seems like just multiplying ratios. Is that all?

13 Comments

2025/01/21
17:28 UTC

Texas hold em poker

What are the odds of winning this poker promotion. We are dealt 30 hands an hour on average.

4 Comments

2025/01/21
07:10 UTC

Borel-Cantelli

I got an assignment that was dismissed by the Prof as "too simple" and therefore was not discussed.

We have a stock which increases by an amount of u (with probability p, 0<p<1) and decreases by an amount of d (with probability 1-p) every week. We assume the changes are stochastically independent. How to calculate the probability of the event "from a certain week onward, the stock only decreases in value"?

I guess I need to use borel-cantelli. Let k be the number of week. The sum over all k of the probability that we have in week k a decrease is infinity: sum_k(P(X_k=d)) = infinity. Because of that we get P(liminf_k (X_k=d)) = 0.

But that seems to be a bit short and I'm missing some steps, right? And does p has any influence on the specified event?

I'm sorry if my english isn't correct. I hope you understand my question. Thank you!

2 Comments

2025/01/20
21:37 UTC

Trying to figure out equation of Gacha Game (League)

Hello ProbabilityTheory,
I am doing a video on a game I played called League which added a new Gacha Monetization System.

The Gacha system is known as The Sanctum and is now the only way to get a currency known as Mythic Essence. I am trying to figure out the: Average Mythic essence obtained per roll, The amount of rolls on average to get 150 mythic essence, and the average amount of Mythic Essence obtained per roll for the first 80 rolls.
This problem has turned out to be incredibly complex for me, due to an addition of a pity system. Which changes the probability of odds when certain categories haven't been selected in x amount of rolls.
Here is how the system is set up:
There is an S-tier category with one unique loot item and a 0.5% chance per roll. (if the item has already been rolled in a previous role, then new item is 270 mythic essence.

https://preview.redd.it/a8q5xg4am3ee1.png?width=837&format=png&auto=webp&s=b33af892ad5c8d5bd9f80b9d1a400ce7a698faf5

The Second Tier is the A-tier, it has a 10% chance per role, with 9 unique items, and if all nine items have already been selected, than the roll gives 35 mythic essence.

https://preview.redd.it/lga1qyeim3ee1.png?width=836&format=png&auto=webp&s=ed1677e9568b3ebf41c93f223ebbdca2698361c0

With the S and A tier, there are two pity systems.
Every 80 rolls is guarantees the next role to be an S-tier reward, and is reset upon rolling an S-tier reward
Every 10 rolls guarantees an A-tier reward, and is reset upon rolling an S or A tier item.

The last Tier is the B tier with a total probability of 89.5%

Within the B tier, there are five rewards for mythic essence:
48.78 % for 5 mythic essence
10.38% for 10 mythic essence
1.432% for 25 mythic essence
0.537% for 50 mythic essence
0.179% for 100 mythic essence

The remaining probability within the B-tier category is two sets of unique items: The first set has 236 items with around a (0.05954% chance per roll) and the second set has 474 items with around a (0.02983% chance per role).

https://preview.redd.it/utnhsqpsn3ee1.png?width=686&format=png&auto=webp&s=e0d8f5609a9a85c4ad4344213a1fc11585110905

As items within the two sets are rolled, that items probability will then be distributed evenly between the items of those two sets, until none remain. Leaving only Mythic Essence to be drawn.

I would appreciate whoever helps me so much in finding the answer. I also will need to Full Formula.
I'm not making this post to try to find out my gambling odds, I'm doing it to find the number so that I can bring awareness to the players who will be rolling, because the amount of rolls you need for 150 mythic essence on average is not very clear, and I have a feeling will be a big big % increase from the old monetization structure.
Thank you so much.

10 Comments

2025/01/20
07:22 UTC

Connection between probability distributions

Hi all.

I recently started learning probability which comes with random variables and their distributions.
So far I've learnt Bernoulli, Binomial, Normal, Poisson, Exponential and Gamma distributions. I want to connect them together. Following is my understanding of probability theory in general (do correct me if I am wrong):

Simply put: Every probability calculation boils down to counting the number of ways something can happen and then dividing it by the number of total things that can happen.

Random variables (RVs) assign numerical values to the outcomes of an experiment. A probability distribution can describe the probability that a RV takes on a certain value. There are well defined probability distributions starting with:

- Bernoulli distribution: describes the probability with which a RV takes on a value of 0 or 1. A Bernoulli RV describes only the success or failure of an experiment.
- Binomial distribution: A binomial RV is a sum of Bernoulli RVs. It can describe the distribution of the probability for the number of k successes in n Bernoulli trials.
- Geometric distribution: This distribution answers the question "What is the probability that the first success in a series of Bernoulli trials will occur at nth try?"
- Normal distribution: It can be described as an approximation of any RV when the number of trials approaches infinity.
- Poisson distribution: Normal distribution can not approximate a binomial distribution when the probability of success is very small. Poisson distribution can do that. So it can be seen as the distribution of occurrence of rare events. So it can answer the question "What is the probability of k successes when the probability of success is very small and the number of trials approaches infinity?"
- Exponential distribution: This is the distribution of the time for the Poisson events. So it answers the question "If a rare event occurs, what is the probability that it will take time t?"
6- Gamma Distribution: This distribution gives us the probability of time it takes for nth rare event to occur.

Please correct me if I am wrong and if you know of any resources which explain these distributions more concretely and intuitively, do share it with me as I am keen on learning this subject.

11 Comments

2025/01/19
07:03 UTC

I'm watching the MrBeast games ep 6 and I'm losing my mind.

It's a really simple probability game, 15 people in a room, 100 trapdoors, and they all have to choose one to stand on. There are 5 safe platforms and 95 unsafe ones, both predetermined from the start. For every 5 trapdoors that MrBeast opens, you can choose to move to another one or stay on the same one. Literally, almost no one chose to move, and the ones who did only moved once. Isn't it obviously better to move every time you have the chance? The chance of moving to a safe trapdoor increases since there are 5 fewer total trapdoors, but the same number of safe platforms.

I don’t know much about math, which is why I’m asking here. Since no one in the show is choosing to move, I'm starting to think maybe I’m wrong.

Thanks for your time!

19 Comments

2025/01/18
02:17 UTC

Probability calculation for quality control

Hi all.

I just watched Steve Brunton's lecture on Quality Control:
https://www.youtube.com/watch?v=e7RAK_iQBp0&list=PLMrJAkhIeNNR3sNYvfgiKgcStwuPSts9V&index=6

I am a bit confused about how the probability is calculated in the lecture, specifically the numerator.

To check my intuition I started out with the simplest example:
Consider a total of n = 3 items out of which k = 1 are defective. We want to find the probability that exactly m = 1 item will be defective if we sample r = 1 item at a time.

Consider 3 items to be "a", "b", "c". The sample space for our little experiment then is S = {a, b, c}. I assumed "a" is the defective item.

Applying the rule of probability "divide the number of ways an event can happen by the number of things that can happen" gives me this probability as 1/3.

Now a little bit more complex:
n = 3, k = 1, m = 1, r =2.
Now the sample space S = {ab, ac, bc} (without replacement and order doesn't matter so there is no ba, ca or cb in S).
The number of things that can happen (the denominator) now is (3*2)/2 = 3 or 3 Choose 2.
The numerator should contain all the possible ways in which exactly one of the samples is defective.
So it should be something like (one item is defective AND the other isn't). I.e. the probability of event A that exactly one of the items is defective out of 2 picked items:

P(A) = 2/3.

These probabilities are in line with the formula given in the video but I haven't been able to grasp the idea of multiplication of two numbers in the numerator.

Can anyone explain it plainly, please?

4 Comments

2025/01/17
00:39 UTC

Choosing an appropriate statistical test

All the smarties, here is a situation for you from a marketing student.

There is a set of ads. There are two models running, model A and B. Those models select a random subset of ads every hour and change some properties of those ads so that as a result those ads are shown/clicked more or less (we do not know if it is more or less). Devise a statistical set/methodology that evaluates which model (A or B) results in more clicks on the ads.

Is there a statistical test that is more appropriate (if any are suitable at all) in this case? NOTE, subsets of ads that models A and B are acting upon change every hour!

3 Comments

2025/01/16
07:49 UTC

MIT intro to prob and stats PS4 question

find pdf of T, where T = min(x1, x2), and xi ~exp(lambda), for Problem 4C:

https://preview.redd.it/gbj4hzxgrade1.png?width=1238&format=png&auto=webp&s=0573754f1f4b92683a15ed52db59f60fa0094fae

Why can't we use f(x)'s pdf at the start to get f(T), if we know that x1 and x2 are independant exp(lambda) variables ? I thought we could do f(x1)*f(x2), which does not give 2 lambda*exp(-2* lambda *t).

3 Comments

2025/01/16
06:07 UTC

Probability Question - Link to Initial Post

[Request] Single Lane Conflict Probability Question : r/theydidthemath

Posting here also to see if any probability wizard can help.

1 Comment

2025/01/16
00:25 UTC

Settle an argument please.

I am having a discussion with someone at my work regarding probability and we have both came up with completely different results.

Essentially, we are playing a work related game with three people out of 14 are chosen to be traitors. Last year, it was very successful and we are going again this year but I would like to know the probability of one of the traitors from last year also being picked this year.

I work it out to be a 5.6% chance as 1 / 14 is 7.5% and the probability of landing that same result is 7.5% x 7.5% = 5.6%

They claim that chances of pulling a Faithful is 11/14 on the first go. 10/13 on the second go and 9/12 on the 3rd go. Multiply together for the chances and you get 900/ 2184. Simplify to 165/364. Then do the inverse for the chances of picking a LY traitor and it's 199/364 or roughly 54.7%

Surely, the chances of hitting even 1 of the same result cannot be more than 50%

I am happy to be proven wrong on this but I do not think that I am..

Go!

5 Comments

2025/01/13
17:43 UTC

Probability of two cars' indicators blinking synchronously?

One time I was coming back from the beach (on acid) and observed two cars' indicators blinking in sync. I'd seen it happen before, but only for a few blinks before they went out of phase. These two cars though, they were synchronous and in phase. It shook me to my core.

How would I go about calculating the probability of this? Even if we assume all indicators blink at the same rate, I don't know where to start!!

4 Comments

2025/01/13
08:24 UTC

MIT ocw intro to probability and stats homework question

The original document with solution can be found here

For PS1 problem 3b, I think the way the solution is, means the question needs to be more precise. It needs to say*

B = two people in the group share the same birthday, **the others are distinct**.

That means one birthdate is already certain, say b1 is shared by 2 individuals.

This means that the number of ways the sequence of n birthdays can exist would be :

365^1 for the two individuals who share the same birthday x 364^n-1 ways that the rest of the elements can be arranged.

therefore P(B) :

P(B) = 1 - P(B^c) = 1- the probability of the birthdays are different to the two people who share b1

P(B^c) = 364! / 365^n

...

# interpretation 2

My thinking was that simply B = two people in the group share the same birthday, the others are a unique sequence of birthdays that excludes b1.

B = a sequence of birthdays that includes two who have the same one.

not B = null set

P(B) = 365^1 x 364^n / 365^n

What do you think of the second interpretation, what am I missing that I didn't go to the first interpretation ? Thank you!

https://preview.redd.it/0k0swxkmpfce1.png?width=1652&format=png&auto=webp&s=f1f9e57fe1d03d711d500b4a5889bbf7e9dabcc9

https://preview.redd.it/gdpyszbopfce1.png?width=1210&format=png&auto=webp&s=15eebe08ca1bad1f04574d773bdd8ffa3c341e4d

https://preview.redd.it/repntqa7qfce1.png?width=1658&format=png&auto=webp&s=c9cae414ed06ed172ac757eb69b59d05ac428a00

I'm

0 Comments

2025/01/11
22:02 UTC

Binomial Distribution for HSV Risks

Please be kind and respectful! I have done some pretty extensive non-academic research on risks associated with HSV (herpes simplex virus). The main subject of my inquiry is the binomial distribution (BD), and how well it fits for and represents HSV risk, given its characteristic of frequently multiple-day viral shedding episodes. Viral shedding is when the virus is active on the skin and can transmit, most often asymptomatic.

I have settled on the BD as a solid representation of risk. For the specific type and location of HSV I concern myself with, the average shedding rate is approximately 3% days of the year (Johnston). Over 32 days, the probability (P) of 7 days of shedding is 0.00003. (7 may seem arbitrary but it’s an episode length that consistently corresponds with a viral load at which transmission is likely). Yes, 0.003% chance is very low and should feel comfortable for me.

The concern I have is that shedding oftentimes occurs in episodes of consecutive days. In one simulation study (Schiffer) (simulation designed according to multiple reputable studies), 50% of all episodes were 1 day or less—I want to distinguish that it was 50% of distinct episodes, not 50% of any shedding days occurred as single day episodes, because I made that mistake. Example scenario, if total shedding days was 11 over a year, which is the average/year, and 4 episodes occurred, 2 episodes could be 1 day long, then a 2 day, then a 7 day.

The BD cannot take into account that apart from the 50% of episodes that are 1 day or less, episodes are more likely to consist of consecutive days. This had me feeling like its representation of risk wasn’t very meaningful and would be underestimating the actual. I was stressed when considering that within 1 week there could be a 7 day episode, and the BD says adding a day or a week or several increases P, but the episode still occurred in that 7 consecutive days period.

It took me some time to realize a.) it does account for outcomes of 7 consecutive days, although there are only 26 arrangements, and b.) more days—trials—increases P because there are so many more ways to arrange the successes. (I recognize shedding =/= transmission; success as in shedding occurred). This calmed me, until I considered that out of 3,365,856 total arrangements, the BD says only 26 are the consecutive days outcome, which yields a P that seems much too low for that arrangement outcome; and it treats each arrangement as equally likely.

My question is, given all these factors, what do you think about how well the binomial distribution represents the probability of shedding? How do I reconcile that the BD cannot account for the likelihood that episodes are multiple consecutive days?

I guess my thought is that although maybe inaccurately assigning P to different episode length arrangements, the BD still gives me a sound value for P of 7 total days shedding. And that over a year’s course a variety of different length episodes occur, so assuming the worst/focusing on the longest episode of the year isn’t rational. I recognize ultimately the super solid answers of my heart’s desire lol can only be given by a complex simulation for which I have neither the money nor connections.

If you’re curious to see frequency distributions of certain lengths of episodes, it gets complicated because I know of no study that has one for this HSV type, so I have done some extrapolation (none of which factors into any of this post’s content). 3.2% is for oral shedding that occurs in those that have genital HSV-1 (sounds false but that is what the study demonstrated) 2 years post infection; I adjusted for an additional 2 years to estimate 3%. (Sincerest apologies if this is a source of anxiety for anyone, I use mouthwash to handle this risk; happy to provide sources on its efficacy in viral reduction too.)

Did my best to condense. Thank you so much!

(If you’re curious about the rest of the “model,” I use a wonderful math AI, Thetawise, to calculate the likelihood of overlap between different lengths of shedding episodes with known encounters during which transmission was possible (if shedding were to have been happening)).

Johnston Schiffer

0 Comments

2025/01/11
01:24 UTC

MIT intro to prob and stats PS2 question

I've read through the theory well, and there are a few questions here that are doing my head in. Problem Sets can be found here.

I've posted it in a pic below. The theory says this conditional prob formula should equate to = P(FF intersect FF, FM) / P (FF) .... how did the solution ignore the intersection in the numerator ?

MIT intro to prob and stats PS2 question , problem 1

My second question is problem 4:

Intuitively, the P(Roll = 3) would be highest with the dice with fewer dice sides. Why would we need Bayes theorem here and conditional probability?

https://preview.redd.it/2ewlqfq8j6ce1.png?width=912&format=png&auto=webp&s=9bcc63e1238a73133aaec070520427af3ef4eb66

https://preview.redd.it/j7hltyfaj6ce1.png?width=935&format=png&auto=webp&s=d813296bdc6d8044c768f7806d39fe830eb731d2

3 Comments

2025/01/10
14:51 UTC

How to predict behaviour of people using probability theory.

So for some time i wondered how can you predict the next choice of a person based on some limited information (for example you are staring at them , or just listening them to gather information) Came across this post on physics forum

and i find it great. But I am here to ask for more advanced techniques maybe? Because it is clear that for this kind of situation you can't make a model because it is too complex. I don't think things like system dynamics or multivariable statistics as listed in the article are practical. I think that probaility here is the best , but what is the right approach? How do you predict something with such limited information? Most importantly i want to know if there is something practical, or point me in the right direction.

2 Comments

2025/01/09
21:39 UTC

distinguishable and non-distinguishable

can someone please explain to me why distinguishable and non-distinguishable matters while calculating probability?

say i have 10 balls that are distinguishable and n urns that are distinguishable, then the numbers of ways of putting the balls in the urns in n^10.

how and WHY does this answer change when the balls are non-distinguishable?

3 Comments

2025/01/07
05:57 UTC