#data-science
Read more stories on Hashnode
Articles with this tag
With multi-step process. ยท There are times when you want an user-defined function (UDF) to return multiple values. You can return them as either a...
Recently had to perform a kind of manual unpivoting and mapping of every row value to separate dataframe with specific schema. Pyspark supports...
There was some uncertainty on whether or not the following line actually interprets returns back a dataframe that has the correct value. All testing...
The integration of Spark and Delta Lake tables is seamless and smooth for the most part. Ran into some issues with unit-tests concerning the creation...
Process has been tested through Mint and Ubuntu distros. Prep Update and upgrade packages: sudo apt update && sudo apt upgrade Install Java: sudo apt...