Cursussen
InloggenBegin met het leren van AI
Cursussen
InloggenBegin met het leren van AI
0

Pyspark glom()

I do understand that it returns RDD coalescing all elements within each partition into a list. What happens when we don’t specify the num of partition, is there is a default? where do we actually use it?

actionpysparktransformationdataengineering
18th Jun 2024, 3:28 AM
Chethana
Chethana - avatar
1 Antwoord
+ 1
Have you tried looking at the documentation? The glom() method does not have any arguments. https://spark.apache.org/docs/latest/api/JUMP_LINK__&&__python__&&__JUMP_LINK/reference/api/pyspark.RDD.glom.html https://stackoverflow.com/questions/24996302/setting-sparkcontext-for-pyspark https://stackoverflow.com/questions/65489387/whats-the-meaning-of-num-slices-parameter-in-sc-parallelize
18th Jun 2024, 4:20 PM
Tibor Santa
Tibor Santa - avatar

Heb je vaak vragen zoals deze?

Leer efficiënter, gratis:

  • Inleiding tot Python

    7,1 miljoen leerlingen

  • Inleiding tot Java

    4,7 miljoen leerlingen

  • Inleiding tot C

    1,5 miljoen leerlingen

  • Inleiding tot HTML

    7,5 miljoen leerlingen

Bekijk alle cursussen
Populair vandaag
What kind of questions do companies ask in Data Analyst interviews (including Python, SQL, Power BI, and Excel)?
1 Votes
??
1 Votes
How many program that you've made even a smaal project are good?
0 Votes
Hello guys,
1 Votes
Help me
1 Votes
Programming
0 Votes
Give some simple practice questions in C
0 Votes
Code coach isn't awarding xp
0 Votes
how do they use javascript
0 Votes
Temperature converter( celsius to fahrenheit)
0 Votes