design doc for parallel_do.md by tonyyang-svail · Pull Request #8425 · PaddlePaddle/Paddle

tonyyang-svail · 2018-02-13T01:58:40Z

No description provided.

JiayiFeng · 2018-02-13T09:28:37Z

doc/design/parallel_do.md

+       fc_grad, allreduce(places, scopes, w1_grad),
+       fc_grad, allreduce(places, scopes, w2_grad)
+}
+block3 {


I think it's better to indicate each blocks' parents.

helinwang · 2018-02-14T19:12:21Z

doc/design/parallel_do.md

+    .AsDuplicable();
+AddInput(kPlaces, "Devices used for parallel processing");
+AddOutput(kOutputs, "Outputs needed to be merged from different devices").AsDuplicable();
+AddOutput(kParallelScopes,


kParallelScopes seems to indicate that there are multiple scopes, but the description says Container, which is a single container:

does container mean scope?

is there a single scope or multiple scopes?

yes

one scope for each device

Maybe change "container" to "scope" and make "one scope for each device" clear? :)

helinwang · 2018-02-14T19:13:46Z

doc/design/parallel_do.md

+```
+In the forward pass
+  |      Split input onto different devices
+  |      Copy parameter to onto different devices


It seems that "Copy parameter to onto different devices" is only done in the first time the parallel do OP happens. Maybe we need to make it clear.

The current version does this at every iteration

helinwang · 2018-02-14T19:15:52Z

doc/design/parallel_do.md

+  |      Merge output from different devices
+
+In the backward pass
+  |      Split output@grad onto different devices


Is it split or duplicate?

helinwang · 2018-02-14T19:17:06Z

doc/design/parallel_do.md

+  |      Split output@grad onto different devices
+  ||||   Compute backward pass in parallel
+  |      accumulate param@grad from different devices to the first device
+  |      Merge input@grad from different devices


Is it input@grad or param@grad?

Another step, Copy param@grad to the place of parallel_do_op, should be added here

helinwang · 2018-02-14T19:18:10Z

doc/design/parallel_do.md

+# get embedding feature on CPU
+feature = some_cpu_only_op(data)
+
+gpu_places = get_place(use_gpu=True)


Can the Python API specify 5 parallel CPU thread when there is no GPU?

helinwang · 2018-02-14T19:19:12Z

doc/design/parallel_do.md

+with pd.do():
+    read_input(feature)
+    prediction = my_net(feature)
+    write_output(activation)


write_output(activation) or write_output(prediction)?

typo. Thanks for pointing it out.

helinwang · 2018-02-14T19:20:24Z

doc/design/parallel_do.md

+    read_input(feature)
+    prediction = my_net(feature)
+    write_output(activation)
+prediction = pd()


Does the Python API support multiple outputs? If so can you provide an example?

helinwang · 2018-02-14T19:24:01Z

doc/design/parallel_do.md

+```python
+pd = ParallelDo(gpu_places)
+with pd.do():
+    feature = pre_fetch(gpu_places)


Sorry I don't understand how pre_fetch will work here, since the pre_fetch is inside the child block of parallel do, it will not run until parallel do run. Isn't that too late for prefetching?

This op hasn't been implemented yet. But there should be a background thread adding the data to the fetching queue before this OP is called.

helinwang · 2018-02-14T19:26:16Z

doc/design/parallel_do.md

+    write_output(activation)
+```
+
+### forward: Copy parameter to onto different devices


Is "Copy parameter to onto different devices" a performance improvement? I agree that this is a more graceful approach, but isn't "Copy parameter to onto different devices" will only run once, so maybe the performance cost is negligible?

Looks that in the body of this section there are other optimizations besides "Copy parameter to onto different devices", maybe need a better title?

Maybe I have this question because I did not fully understand it.

the current implementation of backward only supports updating gradient at one place. So we need to copy the updated parameters at every iterations.

helinwang

LGTM!

abhinavarora · 2018-02-22T18:53:21Z

doc/design/parallel_do.md

+}
+```
+
+## Proformance Imporvement


Just a minor typo here. Proformance -> Performance

abhinavarora · 2018-02-22T18:54:23Z

doc/design/parallel_do.md

+```
+In the forward pass
+  |      Split input onto different devices
+  |      Copy parameter to onto different devices


You can drop the to -> Copy parameter onto different devices

Yang Yang(Tony) added 2 commits February 12, 2018 17:44

Create parallel_do.md

549c74a

Update parallel_do.md

ad2dfef

tonyyang-svail requested review from JiayiFeng, QiJune, chengduoZH, dzhwinter, jacquesqiao, reyoung and wangkuiyi February 13, 2018 01:58

JiayiFeng reviewed Feb 13, 2018

View reviewed changes

Yang Yang(Tony) added 2 commits February 13, 2018 13:50

Update parallel_do.md

c62ef22

fix style

16a8def

helinwang reviewed Feb 14, 2018

View reviewed changes

Update parallel_do.md

8b24bd4

helinwang approved these changes Feb 14, 2018

View reviewed changes

helinwang merged commit 118d950 into PaddlePaddle:develop Feb 14, 2018

abhinavarora reviewed Feb 22, 2018

View reviewed changes

doc/design/parallel_do.md

}

```

## Proformance Imporvement

Copy link

Contributor

abhinavarora Feb 22, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just a minor typo here. Proformance -> Performance

abhinavarora reviewed Feb 22, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

design doc for parallel_do.md#8425

design doc for parallel_do.md#8425
helinwang merged 5 commits intoPaddlePaddle:developfrom
tonyyang-svail:tonyyang-svail-patch-2

tonyyang-svail commented Feb 13, 2018

JiayiFeng Feb 13, 2018

tonyyang-svail Feb 13, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang Feb 14, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang Feb 14, 2018

tonyyang-svail Feb 14, 2018

helinwang left a comment

abhinavarora Feb 22, 2018

abhinavarora Feb 22, 2018

Labels

4 participants

Conversation

tonyyang-svail commented Feb 13, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

helinwang left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Labels

4 participants