על אודות
Love development & challenges.
https://github.com/BarSnir
תרומות
-
What are the most effective data collection strategies for real-time data applications in Data Engineering?
Using the right tools: 1. Does the throughput & volume justify big data tools? Distributed systems today are designed to support big data volumes, and they most efficient On a larger scale ( resource balancing, high availability, etc…) If your data comes in small throughput you might don't need to over-engineer your system design. 2. Can you offer your business a nearly real-time solution? If the answer is yes, watch for Spark structured streaming in combination with Apache Kafka. 3. If you need real-time processing and you going to deal with large throughput….consider the combination of Apache Kafka with Apache Flink…the most powerful combination for real-time processing that I have been experienced for so long.
-
What are the key steps for ETL data enrichment and transformation?
1. Plan your partition distribution: Small partitions, with low data volume, can cost more resources to query. Please calculate your partition size and adjust your processor parallelism. 2. Monitor your ETL data quality. How much Null do you receive in your nullable column? If you monitor the process with proper tools (for example Prometheus & Grafna), you might find interesting behavior in your data. 3. Design well operate jobs: For example, design your code with session configuration that can be provided from the job’s arguments (outer arguments). Control your job without code editing. 4. Optimize a number of machine in Dev env: Try to simulate as possible production env, for getting the correct machine specs for the job.
-
How can you use Apache Flink to extract data from cloud-based summarization tools?
Apache Flink is a stream-first processor. 1. Kafka friendly: Including insert or upsert connectors, for Apache Kafka, Flink has a built-in native configuration for Kafka’s producer and consumer. This fact leads to high-performance real-time processing, with Flink table API or SQL. 2. Confluent’s schema registry friendly: Include supporting JSON, Avro, and more, Flink also supports the extraction of Debezium CDC messages by updating the state tables with proper operation (insert, update, delete)… which saves a lot of implementation. Also…the connectors have fully supported communication with the schema registry, even with a secured one. There are more points why it's stream-friendly, check out this tech for streaming processing.
-
How can you optimize data engineering data replication and synchronization latency?
In my experience, when it comes to No SQL data sources, the bottleneck can be in large update volumes. Some points can help us live well with these updates: 1. Alter the document: avoiding partial updates and replacing the entire document in most cases are more acceptable. Updating the partial fields are more heavy process for segments and filesystem. 2. Use defined bulk size: with the proper metrics and the time diff ingest the data in a way the system can handle it, without causing damage to production, such as search or read queries. 3. The magic trick for updating all the documents : create processes & operations that make an up-to-date collection, and make the read services point to the up-to-date collection.
פעילות
-
Hear why Booking.com chose Confluent to scale their data platform:
Hear why Booking.com chose Confluent to scale their data platform:
נוסף לייק על ידי Bar Snir
-
If you'd like to "play with" Confluent Cloud and see how managing Kafka can become much easier... and how it integrates with AWS... Join us at AWS…
If you'd like to "play with" Confluent Cloud and see how managing Kafka can become much easier... and how it integrates with AWS... Join us at AWS…
נוסף לייק על ידי Bar Snir
ניסיון
חינוך
-
John Bryce
-
רישיונות והסמכות
עוד פעילות על ידי Bar
-
בוקר טוב חברים! יש בידי קו״ח של מפתח באקנד מוכשר ומנוסה במיוחד, עם למעלה מ-7 שנות ניסיון בפיתוח מערכות מתקדמות בשפות כמו Go, Node.js מדובר באיש שעבד…
בוקר טוב חברים! יש בידי קו״ח של מפתח באקנד מוכשר ומנוסה במיוחד, עם למעלה מ-7 שנות ניסיון בפיתוח מערכות מתקדמות בשפות כמו Go, Node.js מדובר באיש שעבד…
נוסף לייק על ידי Bar Snir
-
20 Popular Open Source Projects Started or Supported By Big Companies Google - Kubernetes - TensorFlow - Go - Angular Meta - React - PyTorch -…
20 Popular Open Source Projects Started or Supported By Big Companies Google - Kubernetes - TensorFlow - Go - Angular Meta - React - PyTorch -…
נוסף לייק על ידי Bar Snir
-
Join us for CONFLUENT and AWS Immersion Day, to learn how to unleash and drive a business impact with real-time…
Join us for CONFLUENT and AWS Immersion Day, to learn how to unleash and drive a business impact with real-time…
נוסף לייק על ידי Bar Snir
-
Had fun the other day chatting about #datastreaming Thanks Michal Hershkovitz and the rest of #techGym for being such an experienced and supportive…
Had fun the other day chatting about #datastreaming Thanks Michal Hershkovitz and the rest of #techGym for being such an experienced and supportive…
נוסף לייק על ידי Bar Snir
-
In just a couple of weeks, I'm coming once again to #Vilnius! I will be speaking at Big Data Conference Europe . 📣 This time I will be sharing…
In just a couple of weeks, I'm coming once again to #Vilnius! I will be speaking at Big Data Conference Europe . 📣 This time I will be sharing…
נוסף לייק על ידי Bar Snir
-
When asked in my exit interview if this experience was what I had signed up for when I joined Confluent, my answer was simple: no – I got so much…
When asked in my exit interview if this experience was what I had signed up for when I joined Confluent, my answer was simple: no – I got so much…
נוסף לייק על ידי Bar Snir
-
תמונה של בעלי בעזה. לומד למבחן. הוא יצא משם ישר למבחן והיום התברר שגם עבר אותו בהצלחה. לא מגיע לו פרגון? מגיע בהחלט❤️ חנוך דאום
תמונה של בעלי בעזה. לומד למבחן. הוא יצא משם ישר למבחן והיום התברר שגם עבר אותו בהצלחה. לא מגיע לו פרגון? מגיע בהחלט❤️ חנוך דאום
נוסף לייק על ידי Bar Snir
-
מי שהשתמש בGPT או קלוד שיכתוב לו קוד שירים יד 👋 מי שהעתיקה קוד מ stack overflow שתרים יד 🖐️ מי שלקחו קוד מפרוייקט אחר שמצאו ושילבו אותו בשלהם…
מי שהשתמש בGPT או קלוד שיכתוב לו קוד שירים יד 👋 מי שהעתיקה קוד מ stack overflow שתרים יד 🖐️ מי שלקחו קוד מפרוייקט אחר שמצאו ושילבו אותו בשלהם…
נוסף לייק על ידי Bar Snir
-
בפרק החדש של Hello Cloud, שמרית הדברג מ-Google Cloud, מארחת את עומר פורוס, Solution Engineer ב-Confluent, לשיחה מעמיקה על איך DataStreaming# משנה את…
בפרק החדש של Hello Cloud, שמרית הדברג מ-Google Cloud, מארחת את עומר פורוס, Solution Engineer ב-Confluent, לשיחה מעמיקה על איך DataStreaming# משנה את…
נוסף לייק על ידי Bar Snir
-
Being at #FlinkForward this year makes me very sentimental 🥲 #ApacheFlink celebrates its 10-year anniversary. I didn’t expect to see my name in the…
Being at #FlinkForward this year makes me very sentimental 🥲 #ApacheFlink celebrates its 10-year anniversary. I didn’t expect to see my name in the…
נוסף לייק על ידי Bar Snir
-
We just finished a Customer Success Technical Architect team gathering in Madrid, working on our next fiscal year plans. You could feel the energy in…
We just finished a Customer Success Technical Architect team gathering in Madrid, working on our next fiscal year plans. You could feel the energy in…
נוסף לייק על ידי Bar Snir