prolineplayer.com - The Forum

Posted: **Tue Sep 10, 2013 10:20 am**

matty, dm me your gmail on twitter and ill pop you some data. believe it or not, the window of data is relevant because pitchers go on hot and cold streaks but there is a functional relationship between periodicity of data and predictive efficiency of the historical data set. to be fair i didnt explain this: you should understand we take the last 90 innings and then use that data to model a distribution which is used in a copula driven monte carlo, so really we are extrapolating hundreds of thousands of data points (random inning stats vectors) from a suggested generator data set built from the last "x" innings. we do this, critically, to mimic the stochastic behavior of the on field stats with bound random numbers.

x = 45 innings
x = 90 innings
x = 180 innings

as x gets really big the data you use to generate expected numbers gets worse after a point and using a simple fminsearch func in matlab finds us optimal x.

you do this to get a good set of 500k or so random vectors bound by the machine learning model parameters and plug those into the machine learning algo which will then do its thing and classify results which are cast into WE's

so to sum up A) we arent bound by a small sample size, we are modelling baseball as a sthocastic process and using the work of our peers in machine learning to classify the results.

this is pretty much standard modelling work in investment banking these days in the derivative market.

caveat: some people blame Gaussian Copula for the 2008 financial collapse, its funny to google it and see people rant and rave about a math formula.

Posted: **Wed Mar 25, 2015 6:40 pm**

Im back and this time I am armed with a crap ton of code and data to help win in MLB.

here is where I live. line-breaker.tumblr.com

I am developing a pitch physics method from the pitchFX using some data science techniques to develop different methods of getting +EV. Controversially, I also am developing another model that completely decouples from odds and just picks winners. I am researching how far this can be taken given the rich hitfx, fieldfx and pitchfx we can collect.

I also claim that starting pitcher is almost all that matters in making +EV bets with bullpen as a more minor shift (obviously they pitch less innings).

Disclaimer: I believe +EV is always the best way to think but also claim that winner only strategy can be almost as good as value picking if not better. All of my research is at that blog.

start from post #1 http://line-breaker.tumblr.com/post/107 ... -the-first

I'm going to share all my research and picks here the day before and eventually give access to using my codebase online. Only the data matters.

If I flame out, well great! If I hit it like I have been, then lets talk more GD2 data and pitch physics.

see you all april 5th.

Cheers.

Posted: **Mon Mar 30, 2015 5:23 pm**

This will be a good one. Positive vibe on this for me already.

Posted: **Thu Apr 23, 2015 12:28 pm**

It should be noted that this guy is a tout (he sells picks). Almost all touts are basically scams and a waste of money.

prolineplayer.com - The Forum

Steamsharp Chronicles.

Re: Steamsharp Chronicles.

Re: Steamsharp Chronicles.

Re: Steamsharp Chronicles.

Re: Steamsharp Chronicles.