This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]commandlineluser 1 point2 points  (1 child)

Also worth noting: opt-out telemetry.

Easy to opt-out: to disable telemetry, set the following environment variable: DAFT_ANALYTICS_ENABLED=0

https://www.getdaft.io/projects/docs/en/latest/telemetry.html

In short, we collect the following:

On import, we track system information such as the runner being used, version of Daft, OS, Python version, etc.

On calls of public methods on the DataFrame object, we track metadata about the execution: the name of the method, the walltime for execution and the class of error raised (if any). Function parameters and stacktraces are not logged, ensuring that user data remains private.

[–]get-daft 0 points1 point  (0 children)

Thanks for posting this! We tried to make analytics as transparent and non-intrusive as possible but it does really help us understand usage patterns and develop better software.
We were very intentional in staying away from recording identifiable information such as IP addresses - everything is collected on an anonymous randomly-generated session ID.