Ovidiu Petridean

Ovidiu Petridean

Senior Software Developer @ SDL Research

Kafka in the Big Data ecosystem

This article presents an overview of the concepts, which form the Kafka system, and the role they play in the Big Data ecosystem. The Big Data world is getting more and more popular and the interest for the technologies in this ecosystem is growing. Analytics is often described as one of the most intricate challenges associated with Big Data. We will not focus on why this challenge arises and how it can be approached, because, before performing analytics on data, data has to be integrated and made available to enterprise users. That is where Apache Kafka comes in.

Finding similar entities in BigData models

The purpose of the article is to present a way to find similar entities in BigData models. We will present an algorithm used to find similar sentences in a very large data corpus. Even though the examples presented are focused on finding similar sentences, this algorithm can be used to find any kind of entities that can be described by a set of characteristics.


  • comply advantage
  • ntt data
  • 3PillarGlobal
  • Betfair
  • Telenav
  • Accenture
  • Siemens
  • Bosch
  • FlowTraders
  • MHP
  • Connatix
  • MetroSystems
  • BoatyardX
  • Colors in projects