• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

apache / carbondata / 1230
76%

Build:
DEFAULT BRANCH: master
Ran 01 Nov 2018 10:07AM UTC
Jobs 1
Files 1026
Run time 10min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

pending completion
1230

push

jenkins

Jacky Li
[CARBONDATA-3052] Improve drop table performance by reducing the namenode RPC calls during physical deletion of files

Problem
Current drop table command takes more than 1 minute to delete 3000 files during drop table operation from HDFS

Analysis
Even though we are using HDFS file system we are explicitly we are recursively iterating through the table folders and deleting each file. For each file deletion and file listing one rpc call is made to namenode. To delete 3000 files 3000 rpc calls are made to namenode for file deletion and few more rpc calls for file listing in each folder.

Solution
HDFS provides an API for deleting all folders and files recursively for a given path in a single RPC call. Use that API and improve the drop table operation performance.

Result: After these code changes drop table operation time to delete 3000 files from HDFS has reduced from 1 minute to ~2 sec.

This closes #2868

61811 of 77152 relevant lines covered (80.12%)

1.06 hits per line

Jobs
ID Job ID Ran Files Coverage
1 1230.1 01 Nov 2018 10:07AM UTC 0
80.12
Source Files on build 1230
Detailed source file information is not available for this build.
  • Back to Repo
  • Jenkins Build #1230
  • 82eec10c on github
  • Prev Build on master (#1231)
  • Next Build on master (#1232)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc