Drop table failed when metadata.json file is missing #12062

SGITLOGIN · 2025-01-23T01:49:36Z

Feature Request / Improvement

Spark configuration

spark.sql.catalog.spark_catalog = org.apache.iceberg.spark.SparkSessionCatalog
spark.sql.catalog.spark_catalog.type = hive
spark.sql.extensions=org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions

Question：

drop table failed when metadata.json file is missing

When the metadata.json file is lost, I want to delete this table record from the hive metastore. How to solve this situation?

Query engine

Spark

Willingness to contribute

I can contribute this improvement/feature independently
I would be willing to contribute this improvement/feature with guidance from the Iceberg community
I cannot contribute this improvement/feature at this time

The text was updated successfully, but these errors were encountered:

RussellSpitzer · 2025-01-23T14:31:57Z

The difficulty with this is In Spark is that spark will always call "load" before drop.

So essentially DROP in Spark looks like

Load Catalog
Load Table
Call Table.Drop

This makes it difficult to implement a "force drop" because the call to Table.Drop happens after we have the error at LoadTable. We've had several threads and Issues about this previously, maybe the best thing to do is add a Java Utillity method which just calls the underlying Catalog Drop Function on the Spark Table isntance without going through the load pathway?

For Hive catalogs in Spark we can already work around the Spark interface by calling Drop directly on the hive catalog,
You can do this with a

spark.sharedState.externalCatalog.dropTable("db", "table", false, false)

Which uses's Spark's hive client to call drop directly on the catalog without going through the Spark machinery.

If you have a better solution to fixing SparkCatalog.java please let me know.

ebyhr · 2025-01-24T06:26:39Z

How about adding a new procedure that drops a table without "load" or deleting files?
Trino Iceberg connector has unregister_table procedure as you may already know:

CALL example.system.unregister_table(schema_name => 'testdb', table_name => 'customer_orders')

SGITLOGIN · 2025-01-24T12:33:07Z

OK，I will test this method

SGITLOGIN added the improvement PR that improves existing functionality label Jan 23, 2025

Fokko mentioned this issue Jan 24, 2025

After the metadata.json file of the Iceberg table was accidentally deleted, the drop table failed. How can I delete this table from the metadata? #12016

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drop table failed when metadata.json file is missing #12062

Drop table failed when metadata.json file is missing #12062

SGITLOGIN commented Jan 23, 2025 •

edited

Loading

RussellSpitzer commented Jan 23, 2025

ebyhr commented Jan 24, 2025 •

edited

Loading

SGITLOGIN commented Jan 24, 2025

Drop table failed when metadata.json file is missing #12062

Drop table failed when metadata.json file is missing #12062

Comments

SGITLOGIN commented Jan 23, 2025 • edited Loading

Feature Request / Improvement

Spark configuration

Question：

Query engine

Willingness to contribute

RussellSpitzer commented Jan 23, 2025

ebyhr commented Jan 24, 2025 • edited Loading

SGITLOGIN commented Jan 24, 2025

SGITLOGIN commented Jan 23, 2025 •

edited

Loading

ebyhr commented Jan 24, 2025 •

edited

Loading