Example 4-12 - PerKeyAvg for Python Incorrect

In the example, the `map` method shows to take a `lambda` with two parameters (`key` and `xy`), but it appears as though the python version of spark only has a `map` method that expects a lambda with just a single parameter.

So instead of the following

``` python
r = sumCount.map(lambda key, xy: (key, xy[0]/xy[1])).collectAsMap()
```

We should use

``` python
 r = sumCount.map( lambda kvp: ( kvp[0], kvp[1][0] / kvp[1][1] ) ).collectAsMap()
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example 4-12 - PerKeyAvg for Python Incorrect #24

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Example 4-12 - PerKeyAvg for Python Incorrect #24

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions