I had a startup a few years ago that was in the “eh we’ve got some money left fr...

sterlind · on Oct 12, 2023

Interesting, I'm trying to understand (much less knowledgeable about finance than ML, heh.) But it sounds like you fed it the raw order books (no time dimension), a sequence of order states corresponding to each (a time series), mapped them into the embedding dimension of a decoder-only transformer (the masking), and trained it to predict logits for the next order state?

See, that makes way more sense to me, since it sounds like you used causal self-attention , and actual position embeddings.

I've been interested in some time series stuff, like position embeddings to model actual wall-clock time offsets rather than sequence index, but for textless NLP rather than trading.

anonymoushn · on Oct 12, 2023

XTX markets seems to be doing something in the same genre as this. As I understand it, they are mostly taker.

anonu · on Oct 12, 2023

why didnt you start a hedge fund?

dchftcs · on Oct 12, 2023

They did say it didn't work. The overwhelming majority of finance stuff in published work doesn't work, because they're either too simplistic, poorly backtested, or they get exploited too quickly, so beating those doesn't imply you can run a hedge fund.

The main part here is that it's one thing to predict price action, it's another thing to trade profitably - and in particular they were not able to beat fees, which is a common hurdle if you're new to HFT.

benreesman · on Oct 12, 2023

Basically this. We were heavy infra pros and my cofounder was an HFT veteran so it wasn’t classic implementation shortfall so much as we didn’t solve the “do we enter” threshold on what would be a friction-free windfall.

Galanwe · on Oct 12, 2023

What they describe looks like a single predictor. You can't create a strategy with a single predictor, unless it's incredibly predictive. 99% of the time, a predictor cannot beat its transaction costs alone.

You need to combine hundreds of such predictors to be able to beat costs and have a net profitable strategy.

We have a saying in French that you need a lot of rivers to create a sea.

benreesman · on Oct 12, 2023

So the group involved veterans from like Knight and DRW and stuff: we understood the model of combining lots of small signals with a low-latency regression.

We were trying to learn those signals as opposed to sweat-shop them.

But the broader point holds: signal isn’t alpha.

nyrikki · on Oct 12, 2023

Wasn't the US housing crisis of the late 2000s caused by that 99% threshold?

Not in finance at all but I do use reverse Kalman filters, to which this seems similar in core concepts.

While reverse Kalman filters are incredibly helpful in reducing cloud spends by predicting when to auto scale, you still have to have metrics to quickly recover from mistakes.

Based only on tech interviews with HFT companies, I would assume someone could predict your moves using these methods based on historical data.

But perhaps I am just too risk adverse or am missing the core concept.

benreesman · on Oct 12, 2023

You might be referring to the Gaussian coppula bullshit Dave Li did? [1]

[1] https://en.m.wikipedia.org/wiki/David_X._Li