PostgreSQL Conference Europe 2018 - Speaker Interview - Amit Kapila

Speaker Interview: Amit Kapila

Zheap: An answer to PostgreSQL bloat woes Wednesday 16:20 New York

Parallel query in PostgreSQL: How not to (mis)use it? Friday 09:30 New York

Twitter: @kapila_amit Blog: amitkapila16.blogspot.com LinkedIn: amit-kapila Company website: enterprisedb.com

Could you briefly introduce yourself?

I am working in databases from past 17 years. I am working on Postgres from past 10 years and got actively involved in PostgreSQL community in 2012. I am PostgreSQL committer and major contributor.

How do you engage with the PostgreSQL Community?

I actively participates in the development of PostgreSQL, both developing and reviewing new features. The major work done by me in PostgreSQL includes parallel-query, performance, scalability and durable hash-indexes.

From past 1.5 years or so, I am working on a community project zheap which will provide better control over bloat, reduces database size and reduces write amplification.

Have you enjoyed previous pgconf.eu or FOSDEM conferences, either as attendee or as speaker?

Yes, I have been previously attended pgconf.eu at Tallin as a speaker.

What will your talk be about, exactly? Why this topic?

The talk is about bloat that can get accumulated overtime in postgres and how the new solution (zheap) can help to avoid it. This is one of the common problem that many of the enterprises complain about and at EnterpriseDB, we (me and some of my colleagues) are working from more than a 1.5 years on the new storage format in which only the latest version of the data is kept in main storage and the old versions are moved to an undo log.

I am also a co-speaker in one of the other talks which will be mostly about the usage of parallel-query. We will talk about the cases where user can expect to see benefit and also the cases where user shouldn’t expect benefit. I am working along with my other colleagues in this area since PostgreSQL 9.6.

What is the audience for your talk?

The people who cares about the performance of write workloads and the size of their database or any technology enthusiast.

What existing knowledge should the attendee have?

One who is working with PostgreSQL database and has either heard of or is concerned by the increase in size of their databases over period of time.

What is the one feature in PostgreSQL 11 which you like most?

Improvements to parallelism.

Which other talk at this year’s conference would you like to see?

I would like to attend Andres’s Pluggable Storage and Peter Geoghegan’s Bloat In PostgreSQL as both are in some way related to my ongoing project zheap.