Skip to main content
2 votes
1 answer
55 views

I am using Dask for some processing. The client starts successfully, but I am seeing zero workers. This is how I am creating the client: client = Client("tls://localhost:xxxx") This is the ...
martian muonhunter's user avatar
-3 votes
0 answers
47 views

I’m trying to reason about GPU utilisation and I feel like I’m missing something. If kernels in the default stream run sequentially, then how do we actually fully utilise the GPU? A single kernel ...
Pratisha Bista's user avatar
1 vote
1 answer
148 views

using System.Diagnostics; const int TASKS = 100; var mainSw = Stopwatch.StartNew(); var tasks = Enumerable.Range(0, TASKS).Select(i => Task.Run(async () => { await Task.Delay(...
yuri's user avatar
  • 29
Best practices
0 votes
5 replies
111 views

I need to process a list of objects (not the same shown on the sample), which I thought could be greatly improved by running it in parallel.foreach loop. However, the result is not what I expected. ...
Jlong101's user avatar
Advice
1 vote
0 replies
45 views

I need to trigger a dynamic number of sub-workflows in parallel (around 100) and wait for ALL of them to complete before continuing the main workflow. I’ve implemented a solution but I’m wondering if ...
Michal's user avatar
  • 121
-4 votes
0 answers
44 views

I’ve been reading about Python’s Global Interpreter Lock (GIL), and I’m a bit confused about how it actually works behind the scenes. From what I understand, the GIL allows only one thread to execute ...
Yash Gupta's user avatar
Advice
2 votes
2 replies
63 views

I am working on a global PDE problem that is solved using a standard domain-decomposition strategy (e.g., Scotch, METIS). This part of the computation is well balanced across all MPI processes. ...
hrx71's user avatar
  • 1
Tooling
1 vote
3 replies
77 views

The idea of Persistent-Memory gawk is fabulous because it improves the performance, size, and clarity of many scripts on static and reference data. However, I have a significant problem in adopting ...
Sergio Albert's user avatar
1 vote
0 answers
84 views

I'm trying to run calculations using multiple cores in Python on multiple platforms (Linux, macOS, Windows). I need to pass a large CustomClass Object and a dict (both readonly) to all workers. So far ...
polyte's user avatar
  • 459
0 votes
0 answers
46 views

I am now trying to use FSDP in Huggingface transformers Trainer. The training script is something like train_dataset = Mydataset(...) args = TrainingArguments(...) model = LlamaForCausalLM....
xuehao-049's user avatar
0 votes
0 answers
22 views

I have the following process definition I try to execute on Camunda 7.24 / CibSeven 2.1 which currently logs during execution many OptimisticLockingException. I could already trace it down that it ...
BigMichi1's user avatar
  • 308
0 votes
1 answer
126 views

I have the following C# code : var rand = new Random(1); var range = Enumerable.Range(1, 8); var partition = Partitioner.Create(range, EnumerablePartitionerOptions.NoBuffering); foreach (var x in ...
tigrou's user avatar
  • 4,596
0 votes
1 answer
100 views

This is a bit of a slog so bare with me. I'm currently writing a 3D S(moothed) P(article) H(ydrodynamics) simulation in Unity with a parallel HLSL backend. It's a Lagrangian method of fluid simulation,...
Ben Williams's user avatar
Tooling
0 votes
0 replies
36 views

I am running Flux 1 dev text to image model through ComfyUI in Kaggle. Everything works but I noticed that Kaggle offers a second GPU inside the notebook. If I try to run two instances of the ComfyUI ...
Bram Fran's user avatar
  • 133
1 vote
0 answers
81 views

I am reading a paragraph about the tbb::parallel_scan algorithm from the book Intel Threading Building Blocks, and I understood what the operation does serially, but I am not understanding what are ...
luczzz's user avatar
  • 446

15 30 50 per page
1
2 3 4 5
1464