Trying to figure out how to replace a specific column in Pyspark with null values randomly. So changing a dataframe such as this:
| A  | B  |
|----|----|
| 1  | 2  |
| 3  | 4  |
| 5  | 6  |
| 7  | 8  |
| 9  | 10 |
| 11 | 12 |
and randomly change 25% of the values in column 'B' to null values:
| A  | B    |
|----|------|
| 1  | 2    |
| 3  | NULL |
| 5  | 6    |
| 7  | NULL |
| 9  | NULL |
| 11 | 12   |
