
- python - PySpark: "Exception: Java gateway process exited before ...- I'm trying to run PySpark on my MacBook Air. When I try starting it up, I get the error: Exception: Java gateway process exited before sending the driver its port number when sc = … 
- python - Spark Equivalent of IF Then ELSE - Stack Overflow- python apache-spark pyspark apache-spark-sql edited Dec 10, 2017 at 1:43 Community Bot 1 1 
- Pyspark: get list of files/directories on HDFS path- Mar 2, 2016 · As per title. I'm aware of textFile but, as the name suggests, it works only on text files. I would need to access files/directories inside a path on either HDFS or a local path. I'm … 
- Comparison operator in PySpark (not equal/ !=) - Stack Overflow- Aug 24, 2016 · Comparison operator in PySpark (not equal/ !=) Asked 9 years, 2 months ago Modified 1 year, 8 months ago Viewed 164k times 
- pyspark - How to use AND or OR condition in when in Spark- 107 pyspark.sql.functions.when takes a Boolean Column as its condition. When using PySpark, it's often useful to think "Column Expression" when you read "Column". Logical operations on … 
- pyspark : NameError: name 'spark' is not defined- Alternatively, you can use the pyspark shell where spark (the Spark session) as well as sc (the Spark context) are predefined (see also NameError: name 'spark' is not defined, how to solve?). 
- How do I replace a string value with a NULL in PySpark?- Mar 7, 2023 · I want to do something like this: df.replace('empty-value', None, 'NAME') Basically, I want to replace some value with NULL, but it does not accept None as an argument. How can … 
- pyspark: rolling average using timeseries data - Stack Overflow- Aug 22, 2017 · pyspark: rolling average using timeseries data Asked 8 years, 2 months ago Modified 6 years, 2 months ago Viewed 77k times 
- Pyspark: Parse a column of json strings - Stack Overflow- I have a pyspark dataframe consisting of one column, called json, where each row is a unicode string of json. I'd like to parse each row and return a new dataframe where each row is the … 
- Pyspark: explode json in column to multiple columns- Jun 28, 2018 · Pyspark: explode json in column to multiple columns Asked 7 years, 4 months ago Modified 7 months ago Viewed 87k times