• Skip to primary navigation
  • Skip to content
  • Skip to primary sidebar
Clean Programmer

Clean Programmer

Programming & DevOps Resources

  • Home
  • Library
  • About
  • Contact

druid.io

How to Configure Logging in Druid

November 24, 2018 Monzurul Haque Shimul

One of the most common problems running Druid with default settings is that it might take gigabytes of spaces for logging. I’ve faced this issue and found a lot of people having the same in Druid community and StackOverflow. This article will demonstrate how to correctly configure logging in Druid to save disk spaces. Druid… Read More

Druid druid.io imply.io

How To Compact Druid Data Segments Using Compaction Task

November 23, 2018 Monzurul Haque Shimul

Druid stores data in segments. Each segment is a single file, typically comprising up to a few million rows of data. Because there is some per-segment memory and processing overhead, it can sometimes be beneficial to reduce the total number of segments. This tutorial demonstrates how to compact existing segments into fewer but larger segments using Druid compaction task.

Read More about How To Compact Druid Data Segments Using Compaction Task

Druid druid.io imply.io kafka

How to Reindex Data in Druid with Native Batch Ingestion

November 22, 2018 Monzurul Haque Shimul

Once you ingest some data in a dataSource for an interval and create Druid segments, you might want to make changes to the ingested data. For example, if you want to add or remove columns from your existing segments, or you want to change the rollup granularity of your segments, you will have to reindex your data. Kafka Indexing Service may produce a number of segments based on topic partition and granularity configurations. So you need to reindex data to reduce the number of segments. All of these can be done by reindexing the data using Hadoop batch ingestion or native batch ingestion. In this article, I will demonstrate how to reindex data in Druid using the native batch ingestion.

Read More about How to Reindex Data in Druid with Native Batch Ingestion

Druid druid.io imply.io kafka

Loading Data From Apache Kafka to Druid

November 21, 2018 Monzurul Haque Shimul

One of the most popular trends in the data world is the stream analytics. Organizations are increasingly striving to build solutions that can provide immediate access to key business intelligence insights through real-time data exploration. Using Apache Kafka and Druid we can easily build an analytics stack that enables immediate exploration and visualization of event data. This tutorial demonstrates how to load data streams from a Kafka topic to Druid, using the Druid Kafka indexing service.

Read More about Loading Data From Apache Kafka to Druid

Apache Kafka Druid druid.io imply.io kafka

Loading data into Druid from TSV file

November 10, 2018 Monzurul Haque Shimul

In this article, I am going to demonstrate how to load data into Druid from TSV file, using Druid’s native batch ingestion using TSV ParseSpec. I assume you already have a good understanding of Druid architecture and have Druid installed and running. If not, see my previous post to quickly install and run Druid. TSV ParseSpec TSV ParseSpec… Read More

Druid druid.io imply.io

  • « Previous Page
  • Page 1
  • Page 2
  • Page 3
  • Page 4
  • Page 5
  • Next Page »

Primary Sidebar

Categories

  • Apache Kafka
  • Druid
  • Git
  • Java
  • Java EE
  • Redis
  • Spring
  • Uncategorized
  • Weblogic
  • Wildfly

Featured Posts

Automate git reposotory migration from local git server to bitbucket – Part 1

Improving the performance of WildFly application server to handle concurrency

Git Alias: Make your Git experience simpler, easier, faster and clean

Loading Json Data From Batch File Into Druid

How to Reindex Data in Druid with Native Batch Ingestion

Tags

bash bitbucket cassandra cloudserver curl docker druid.io eclipselink ejb git imply.io java java-ee jaxws jboss jboss-cli jdbc jdk jms kafka maven minio mssql mysql ojdbc oracle postgresql redis rest rest-template S3 scality sdk sdkman soap spring sqlserver stream stream api weblogic web services wildfly wsdl zenko zookeeper

Archives

  • January 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • August 2018
  • July 2018
  • June 2018
  • May 2018

Copyright © 2019 · CLEAN PROGRAMMER

  • Privacy Policy
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OKNoRead more
Revoke Cookies