Replies: 4 comments 5 replies
-
@Susmit07 I'll leave this open just in case some developers see this and can share some insights. In practice, Apache Pekko is a volunteer org with a small active community. We don't really have much bandwidth for answering general questions like this. There are 2 connectors for S3 and HDFS as you describe and they are there for you to try out. If you hit specific issues, please get in touch. |
Beta Was this translation helpful? Give feedback.
-
@pjfanning we have a requirement of pulling parquet file whenever its available in the specified hdfs path, and we need to be sure on the exactly once delivery ( no data duplication on the sink side), is it feasible using pekko connectors ? |
Beta Was this translation helpful? Give feedback.
-
but that is no problem because instead you make sure those retries don't lead to duplication. that's not feasible in our case, the only approach can be thought is using s3 multipart upload all-or-none .. |
Beta Was this translation helpful? Give feedback.
-
@pjfanning @raboof One more question suppose we deploy the code in a cluster mode, and both the process will be watching the same directories, there can be race condition where both the process may read the same file, so we will need a distribute lock mechanism like ZK / redis etc. Does pekko as a framework has any ways to lock across cluster nodes. I know there is LWWMap (Last-Write-Wins Map) which is a part of Pekko Distributed Data, which provides CRDTs (Conflict-Free Replicated Data Types) for managing distributed, replicated state in a cluster, but while doing a POC i found nondeterministic results |
Beta Was this translation helpful? Give feedback.
-
Hello Developer community,
We have a requirement oc pulling data from HDFS to putting into S3.
We are exploring Pekko connectors, do you think it's an ideal use case to use Pekko connectors?
Hdfs connectors to pull data from hdfs and s3 connectors to push to s3 bucket
Requesting your help
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions