The info on logging from pyspark found in this question How to turn off INFO logging in PySpark? is not working for me: the logging is not happening.
I am familiar with setting up logging in pyspark from a locally built spark. I am however using the cloudera spark now. I have set a RollingFileAppender within the
$SPARK_HOME/log4j.properties
which is the correct thing to do according to the docs:
http://spark.apache.org/docs/1.2.0/configuration.html#configuring-logging
Configuring Logging Spark uses log4j for logging. You can configure it by adding alog4j.properties file in the conf directory. One way to start is to copy the existing log4j.properties.template located there.
But that is not taking effect: no logging files are created in the destination directory.