pyspark.streaming.DStream.partitionBy# DStream.partitionBy(numPartitions, partitionFunc=<function portable_hash>)[source]# Return a copy of the DStream in which each RDD are partitioned using the specified partitioner.