In this article, I am going to demonstrate how to load data into Druid from CSV file, using Druid’s native batch ingestion using CSV ParseSpec. I assume you already have a good understanding of Druid architecture and have Druid installed and running. If not, see my previous post to quickly install and run Druid using… Read More
Druid supports “multi-value” string dimensions. These are generated when an input field contains an array of values instead of a single value. topN and groupBy queries can group on multi-value dimensions. When grouping on a multi-value dimension, all values from matching rows will be used to generate one group per value. It’s possible for a query to return… Read More
In my previous article, I have demonstrated how to perform a batch file load, using Druid’s native batch ingestion. And I have only shown handling of root level elements of json and I have intentionally skipped the nested elements of json. That’s because nested json needs special handling for ingestion into Druid, they need to… Read More
In case of time-series events data in a relational database, stored one event per row, If we need to calculate the number of events per hour, we’d select all rows within an overall interval, group those rows by hour, and count the rows in each hour group. If we have to perform this query many… Read More
Many Java applications have dependencies that aren’t available in a public or private Maven repository. Sometimes you may have just a third party java library, packaged as a jar, that you need to integrate in your application. This is very common when you have Oracle as your database and you need to connect to the db using the ojdbc.jar which is not available in maven central repository. You need to download it from Oracle website and then need to integrate it in the maven project as a third party dependency.
In this post, I will show how to add a third party client jar to a maven project.