Skip to main content
-3 votes
0 answers
43 views

We maintain multiple counters where each incoming request increments or decrements one or more counters. These counters are bounded by a max value, once that is reached, we reject the requests. ...
tusharRawat's user avatar
0 votes
0 answers
28 views

MixCoord keeps routing requests to non-existent QueryNodes after a Kubernetes worker node reboot in Milvus I’m running a Milvus 2.5.x cluster on Kubernetes, where each worker node hosts a full set of ...
Schiffer Marget's user avatar
0 votes
0 answers
52 views

I created a 5 GPU Cluster using three nodes/machines locally using the tensorflow.distributed.MultiWorkerMirrored Strategy. One machine has the Apple M1 Pro Metals GPU, the other two nodes has NVIDIA ...
vinhdiesal's user avatar
0 votes
0 answers
44 views

I don't seem to understand how clusters work in the parameterServerStrategy in TensorFlow, and I need some clarifications. I have read this tutorial, but they don't mention or explain clearly how to ...
ali-saaeddin-1123581321's user avatar
0 votes
0 answers
82 views

I implemented an algorithm in a Decentralized Federated Learning (DFL) environment. When I experimented with MNIST and Fashion-MNIST, I achieved an accuracy of 80–90%. However, when testing with CIFAR-...
ddochi's user avatar
  • 37
0 votes
0 answers
461 views

I am completely new to distributed programming and I have been trying to port the original code that ran on a multi-node cluster to single-node cluster with multiple GPUs. My goal is to simulate a ...
soumya_sarkar.19's user avatar
0 votes
0 answers
38 views

We have Kafka as the bus and Akka Cluster Sharding as the application distributed cluster. So we need to consume data from Kafka and process them in Akka Cluster. For a now we implement separate ...
Donz's user avatar
  • 1,397
0 votes
0 answers
92 views

I am new to pytorch-distributed, and any input will help. I have a code working with a single GPU. I am trying to make it distributed. I am getting a socket connect error. Below is the code ( I am ...
GPS-999's user avatar
  • 11
0 votes
1 answer
42 views

I have a number of workstations that run long processes containing sequences like this: x = wait_while_current_is_set y = read_voltage z = z + y The workstations must maintain synchronization with a ...
david's user avatar
  • 2,706
0 votes
0 answers
143 views

Introduction I'm new in PyTorch distributed and multiprocessing and I met the unexpected problems: I have leant that processes created by spawn will execute the given function, but my processes ...
Mecreative's user avatar
2 votes
2 answers
230 views

Imagine a 3 node raft cluster. Each node is in sync has log [1,2,3] and entry 3 is committed by the leader. Now leader receives an entry 4 but fails to commit it because of unreliable network and ...
Dumb_Pegasus's user avatar
1 vote
0 answers
211 views

Has anyone used distributed SQL caching in .Net 4.7.2 ? I have seen many sample code for SQL caching with .Net Core but not with .Net Framework 4.7.2. We are currently using Redis cache in the ...
Monisha's user avatar
  • 11
-1 votes
1 answer
394 views

I've been setting up SeaweedFS on a cluster of three nodes and encountered issues when configuring the S3 gateway. The S3 gateway tries to connect to the incorrect gRPC port 28888 instead of the ...
quarks's user avatar
  • 35.7k
0 votes
1 answer
483 views

I need select data from some distributed and local table, insert into another stand alone local table。I use sql like this: INSERT into local_table SELECT FROM distributed_table WHERE ... . The ...
Sam Wang 's user avatar
1 vote
2 answers
371 views

I am trying to use the Uniswap SDK to get historical rates between two coins on a pool. I believe the rate is simply just xy = k, where k is a constant. If someone buys n coins of x, the cost in terms ...
efwefwefwefwefw wefwefwefwef's user avatar

15 30 50 per page
1
2 3 4 5
149