Scaling Python and PySpark using Vectorized UDFs and Apache Arrow

Li Jin, Distributed System Engineer, Two Sigma Investments, presented this at the 13 June 2018 STAC Summit in New York.

Download the slides below.

NOTE: Some or all of the content on this page and its attachment(s) were supplied by a party other than STAC. STAC does not endorse the content. No performance claims are supported by STAC except those found in an official STAC Report of results audited by STAC.