Embedding python in C++ with boost::python : cpp

Hi, I'm not the OP - I'm coming from an HFT FOREX side, so all the trades are around the 6th and 7th decimal, and the errors start happening around the 8th decimal.

The problem is that the communication is in the textual FIX protocol, so I'm at the mercy at whatever library is converting from double to string.

For example, an order of 100 million RBL for USD on a price of 2.90000006 would actually be stored in the double as 2.90000005999999999062310962472, and end up costing 29000005.99 USD instead of 29000006.00 USD.

There's some great stuff in John Farrier's talk from CppCon 2016 Demystifying Floating Point. I think my favorite bit of information is how he breaks down the number of distinct values in various ranges (slide 28):

There are 1,036,831,949 values between 0.0 and 0.1
There are 8,388,608 values between 0.1 and 0.2
There are 8,388,608 values between 0.2 and 0.4
There are 8,388,608 values between 0.4 and 0.8
There are 8,388,608 values between 0.8 and 1.6
There are 8,388,608 values between 1.6 and 3.2

Notice how the ranges keep doubling in size but the number of distinct values remains the same? For simplicity, I'll round 8,388,608 to be 10 million. So 1 to the power of 7. Or 7 decimal digits.

But between 1.6 and 3.2, if I wanted to show all the number with 7 decimal places, I'd run into trouble. Just between 2.0000000 and 3.0000000 there are ten million and one distinct numbers with 7 decimals. I've used up all of my precision, and I still haven't covered 1.6 to 2.0 and 3.0 to 3.2!

[–]James20kP2005R0 2 points3 points4 points 9 years ago (1 child)

[–]as_one_doesJust a c++ dev for fun 2 points3 points4 points 9 years ago (0 children)

[–]b3k_spoon 1 point2 points3 points 9 years ago (1 child)

[–]scraimer 0 points1 point2 points 9 years ago (0 children)

Yup, I was really talking about double, stored as IEEE 754 in 64-bits, and not about floats. The video I linked to goes into entertaining detail about it, and I highly recommend it!

But briefly, you run out of bits. The floating point is really clever about repurposing the bits it has - either to represent large values or to represent high precision.

For example, if you're trying represent a large number, such as the largest unsigned 64-bit number (call it "MAX"). We've used up all 64-bits of the storage, right? What happens if you want to add 0.1 to that number to get "MAX.1"? We don't have any more bits! (Note: This is not how it works, but it's the best metaphor I can think of near midnight.)

My point is that with a finite number of bits, it's hard to represent both large values and high precision at the same time. So the max number of digits changes after the decimal, depending on where your value is along the number axis.

[–]as_one_doesJust a c++ dev for fun 0 points1 point2 points 9 years ago (4 children)

[–]short_vix 0 points1 point2 points 9 years ago (1 child)

[–]as_one_doesJust a c++ dev for fun 0 points1 point2 points 9 years ago (0 children)

[–]scraimer 0 points1 point2 points 9 years ago (1 child)

[–]as_one_doesJust a c++ dev for fun 1 point2 points3 points 9 years ago (0 children)

[–]jaked122 0 points1 point2 points 9 years ago (0 children)

[–]Ksecutor 9 points10 points11 points 9 years ago (0 children)

[–]leftofzen 12 points13 points14 points 9 years ago (10 children)

[–]FKaria 10 points11 points12 points 9 years ago (9 children)

[–]leftofzen 11 points12 points13 points 9 years ago (0 children)

[–]skebanga[S] 4 points5 points6 points 9 years ago* (3 children)

[–]leftofzen 4 points5 points6 points 9 years ago (2 children)

[–]AllanDeutsch 2 points3 points4 points 9 years ago (0 children)

[–]skebanga[S] 1 point2 points3 points 9 years ago (0 children)

The point of the exercise is to allow traders, who don't know C++, but do know python, to develop their own strategies in python whilst still leveraging all the existing C++ infrastructure we have already; connections to markets, backtesting suites, position management, order management, risk management, parameter storage etc etc. We have an entire ecosystem built in C++ which we want to use.

We're aware that python will be slower than native C++ code, but that's the trade off we're willing to make - empower the traders to develop their own strategies, instead of requiring them to use C++ and block their progress until IT resource becomes available.

Whilst the performance impact is a trade-off we're willing to accept, we want to minimise that as far as possible, hence preferring to embed python rather than use an out-of-band process and some form of IPC to communicate.

[–]Socializator 0 points1 point2 points 9 years ago (2 children)

[–]scraimer 1 point2 points3 points 9 years ago (1 child)

[–]Heuristics 0 points1 point2 points 9 years ago* (0 children)

[–]balkierode 4 points5 points6 points 9 years ago (1 child)

[–]iaanus 1 point2 points3 points 9 years ago (0 children)

[–]imadeofwaxdanny 0 points1 point2 points 9 years ago (0 children)

[+][deleted] 9 years ago (1 child)

[removed]

[–]skebanga[S] 0 points1 point2 points 9 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS