Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SIGSEGV in Tree.cellDataProc when calling TreeItem.setImage #678

Open
torokati44 opened this issue May 18, 2023 · 46 comments · May be fixed by #1612 or #1712
Open

SIGSEGV in Tree.cellDataProc when calling TreeItem.setImage #678

torokati44 opened this issue May 18, 2023 · 46 comments · May be fixed by #1612 or #1712

Comments

@torokati44
Copy link

torokati44 commented May 18, 2023

Describe the bug

Virtual Tree widget crashes JVM (through GTK3) when setting Image onto TreeItem in Listener to SWT.SetData.
This is a continuation of https://bugs.eclipse.org/bugs/show_bug.cgi?id=573090.

---------------  T H R E A D  ---------------

Current thread (0x00007ffa500284a0):  JavaThread "main" [_thread_in_native, id=112372, stack(0x00007ffa57900000,0x00007ffa57a00000)]

Stack: [0x00007ffa57900000,0x00007ffa57a00000],  sp=0x00007ffa579fc318,  free space=1008k
Native frames: (J=compiled Java code, j=interpreted, Vv=VM code, C=native code)
C  [libgobject-2.0.so.0+0x3b251]  g_type_check_instance_is_fundamentally_a+0x11
C  [libswt-pi3-gtk-4958r2.so+0x4b609]  Java_org_eclipse_swt_internal_gtk_OS_g_1object_1set__J_3BJJ+0x4a

Java frames: (J=compiled Java code, j=interpreted, Vv=VM code)
J 11988  org.eclipse.swt.internal.gtk.OS.g_object_set(J[BJJ)V (0 bytes) @ 0x00007ffa491c5883 [0x00007ffa491c5820+0x0000000000000063]
J 10921 c1 org.eclipse.swt.widgets.Tree.cellDataProc(JJJJJ)J (486 bytes) @ 0x00007ffa419b8bf4 [0x00007ffa419b81e0+0x0000000000000a14]
J 10920 c1 org.eclipse.swt.widgets.Display.cellDataProc(JJJJJ)J (29 bytes) @ 0x00007ffa41f4ac5c [0x00007ffa41f4aa60+0x00000000000001fc]
v  ~StubRoutines::call_stub
J 11619  org.eclipse.swt.internal.gtk3.GTK3.gtk_main_iteration_do(Z)Z (0 bytes) @ 0x00007ffa49380e69 [0x00007ffa49380e20+0x0000000000000049]
J 11623 c1 org.eclipse.swt.widgets.Display.readAndDispatch()Z (88 bytes) @ 0x00007ffa423e622c [0x00007ffa423e6060+0x00000000000001cc]

To Reproduce

package swt_tree_crash;

import org.eclipse.swt.SWT;
import org.eclipse.swt.graphics.Image;
import org.eclipse.swt.layout.FillLayout;
import org.eclipse.swt.widgets.Display;
import org.eclipse.swt.widgets.Event;
import org.eclipse.swt.widgets.Listener;
import org.eclipse.swt.widgets.Shell;
import org.eclipse.swt.widgets.Tree;
import org.eclipse.swt.widgets.TreeItem;

public class Main {
    public static void main(String[] args) {
        Display display = new Display();
        Shell shell = new Shell(display);
        shell.setLayout(new FillLayout());

        Tree tree = new Tree(shell, SWT.VIRTUAL);
        tree.addListener(SWT.SetData, new Listener() {
            public void handleEvent(final Event e) {
            	TreeItem item = (TreeItem)e.item;
                item.setText(0, "A");
                item.setImage(new Image(display, 20, 20)); // <-- this is the critical line!
            }
        });
        tree.setItemCount(1);
        
        shell.setSize(400, 300);
        shell.open();

        while (!shell.isDisposed()) {
            if (!display.readAndDispatch()) {
                display.sleep();
            }
        }
        display.dispose();
    }
}

Expected behavior

The application always shows a window with a tree widget in it, containing one item, with a small white icon.

Screenshots

Not applicable.

Environment:

  1. Select the platform(s) on which the behavior is seen:
    • All OS
    • Windows
    • Linux
    • macOS
  1. Additional OS info (e.g. OS version, Linux Desktop, etc)

Operating System: Fedora Linux 38
KDE Plasma Version: 5.27.4
KDE Frameworks Version: 5.105.0
Qt Version: 5.15.9
Kernel Version: 6.2.15-300.fc38.x86_64 (64-bit)
Graphics Platform: Wayland
Processors: 16 × AMD Ryzen 7 3800X 8-Core Processor
Memory: 31.2 GiB of RAM
Graphics Processor: AMD Radeon RX 550 Series

org.eclipse.swt.internal.gtk.version=3.24.37

  1. JRE/JDK version

JRE version: OpenJDK Runtime Environment Temurin-17.0.7+7 (17.0.7+7) (build 17.0.7+7)
Java VM: OpenJDK 64-Bit Server VM Temurin-17.0.7+7 (17.0.7+7, mixed mode, tiered, compressed oops, compressed class ptrs, g1 gc, linux-amd64)

Version since

Tested on Eclipse 4.27 (SWT 4.958).

Workaround (or) Additional context

The crash doesn't always happen, but fairly often.
Removing the TreeItem.setImage call avoids the crash entirely.

@torokati44 torokati44 changed the title JVM crash in Tree.cellDataProc when calling TreeItem.setImage SIGSEGV Tree.cellDataProc when calling TreeItem.setImage May 18, 2023
@torokati44 torokati44 changed the title SIGSEGV Tree.cellDataProc when calling TreeItem.setImage SIGSEGV in Tree.cellDataProc when calling TreeItem.setImage May 18, 2023
@SyntevoAlex
Copy link
Member

Sorry about the waiting time, SWT currently has few active maintainers.
I'm generally interested in JVM crashes, and plan to have a look at this eventually. Unfortunately the waiting time could be around a month.

@torokati44
Copy link
Author

Hello, just a ping on this issue: We are still often encountering this crash on an almost daily basis... :/

@SyntevoAlex
Copy link
Member

SWT is currently low on active contributors.
It would be nice if you can debug the problem yourself and prepare a fix.
Otherwise, alas, you'll have to wait more, and there's no ETA.

@torokati44
Copy link
Author

torokati44 commented Aug 30, 2023

Otherwise, alas, you'll have to wait more, and there's no ETA.

Of course, I understand, FOSS is what it is...

It would be nice if you can debug the problem yourself and prepare a fix.

Actually we ended up tossing the problematic pieces of code (in this one and #697) into a Display.asyncExec callback, so they can be done at a time when Gtk is hopefully no longer confused. It's dumb, but appears to be working so far.

@Mailaender
Copy link
Contributor

I found a way to reproduce all the time in a @flatpak runtime, where also all workarounds fail.

@SyntevoAlex
Copy link
Member

It would be useful if you describe how exactly do you reproduce.

@Mailaender
Copy link
Contributor

Sorry, I thought that it won't be that helpful because you need to build a package to test.

Here it is https://github.com/Mailaender/flathub/compare/chemclipse Check the README for the commands to build it.

Click Demo on the green intro screen to trigger the crash.

Use

runtime-version: 20.08

and rebuild to see how the application looked before the crash was introduced.

@SyntevoAlex
Copy link
Member

Oh, I didn't expect a whole other application. Indeed I don't have the time to debug it.
Can you please post your hs_err_pid.log crash reports?

@Mailaender
Copy link
Contributor

@Mailaender
Copy link
Contributor

You can also indirectly trigger this with

public class MyLabelProvider extends LabelProvider implements ILabelProvider {
	
	@Override
	public Image getImage(Object element) {
		// return something other then null
	}
}

This is sometimes difficult to reproduce. I had my best chances when there was only one unexpanded item in the tree viewer.

@Mailaender
Copy link
Contributor

Mailaender commented Jan 30, 2024

I can't reproduce this in a development environment so this might be already accidentally fixed?

grafik

@torokati44
Copy link
Author

We still have the Display.asyncExec workaround in place, so I can't comment on it, as we haven't encountered it since.
It's possible it got fixed, but I can't verify it due to lack of time, sorry.

@Mailaender
Copy link
Contributor

There are occasions where the Display.asyncExec either does not work or only lessens the problem.

I just copied your snippet into my 2023-03 project and it crashed. It does not when I try it here.

The instructions at https://www.eclipse.org/swt/git.php are somehow outdated or not very precise, but this should work:

Install git-lfs. Git clone this repository. Only import the repositories (plus snippets) and rename or copy

I just ignored the API baseline error and got a test environment where the problem is not reproducible or hopefully already fixed.

@Mailaender
Copy link
Contributor

Updated consecutively up to Eclipse 2023-12 and the crash is still there.

@Mailaender
Copy link
Contributor

While trying to work around it, I found another way to crash more reliable:

public class MyLabelProvider extends LabelProvider implements ILabelProvider {

	@Override
	public Image getImage(Object element) {
		try {
			Thread.sleep(100);
		} catch(InterruptedException e) {
			logger.warn(e);
		}
		new Image(display, 16, 16))
	}
}

@Mailaender
Copy link
Contributor

Mailaender commented Mar 19, 2024

I have no real fix yet, and the TreeViewer API does not allow for a simple override of the setImage function. However, I found out that only the first setImage will crash, all subsequent won't. So setImage(null) will make all following calls safe.

@Mailaender
Copy link
Contributor

I also don't understand why this crash does not happen with the Eclipse IDE. The PDE Project Explorer and Java Package Explorer as well as the Git Repositories all use trees with icons.

@basilevs
Copy link
Contributor

@SyntevoAlex could you please suggest a way to detect GTK warnings from Junit tests?

GTK has been desperately trying to warn us about this problem, but we always ignore its printout to standard error:

My Application Name:4909): GLib-CRITICAL **: 14:43:25.864: g_hash_table_foreach: assertion 'version == hash_table->version' failed

(My Application Name:4909): GLib-GObject-WARNING **: 14:43:25.897: g_object_notify_queue_thaw: property-changed notification for GtkCellRendererText(0x7f06dded6990) is not frozen

@ericwill
Copy link
Contributor

@ericwill why does TreeItem.setImage() recreate renderer for a single column? Should not all columns be affected?

You're asking me about code I haven't touched in ~4 years, so I might be wrong. If memory serves correctly the method in question modifies only one column at a time as per SWT API. Also the method in question involves SWT.VIRTUAL trees which are performance sensitive (lazy loaded) so we are not looking to make broad changes as it could incur a performance hit.

@basilevs
Copy link
Contributor

You're asking me about code I haven't touched in ~4 years, so I might be wrong. If memory serves correctly the method in question modifies only one column at a time as per SWT API. Also the method in question involves SWT.VIRTUAL trees which are performance sensitive (lazy loaded) so we are not looking to make broad changes as it could incur a performance hit.

It was trying to solve cropping of images by using actual size in gtk_cell_renderer_set_fixed_size(renderer, x, y) call, however, it uses Tree-global values, so was worried about other columns, where this call does not happen and leaves renderer configuration unsynchronized with global setting, potentially leaving invalid cropping for all but one column.

This is fine, neither of us probably have time to write a test anyway 🤷

@basilevs
Copy link
Contributor

The enhaced reproducer with a fixed SWT version can now be found in https://github.com/basilevs/bug_678

basilevs added a commit to basilevs/eclipse.platform.swt that referenced this issue Nov 25, 2024
GTK 3.24.41 (Ubuntu 24) crashes when renderers are replaced during
render.
If replacement is postponed, the crash no longer happens.

Fixes eclipse-platform#678.
basilevs added a commit to basilevs/eclipse.platform.swt that referenced this issue Nov 25, 2024
GTK 3.24.41 (Ubuntu 24) crashes when renderers are replaced during
render.
If replacement is postponed, the crash no longer happens.

Fixes eclipse-platform#678.
basilevs added a commit to basilevs/eclipse.platform.swt that referenced this issue Nov 25, 2024
GTK 3.24.41 (Ubuntu 24) crashes when renderers are replaced during
render.
If replacement is postponed, the crash no longer happens.

Fixes eclipse-platform#678.
@om-11-2024
Copy link

om-11-2024 commented Nov 29, 2024

Thread.sleep() was mentioned above

It tried adding Thread.sleep() to the first code snippet for reproducing the bug, and now the bug to appears at every run of the snippet on my machine.

package swt_tree_crash;

import org.eclipse.swt.SWT;
import org.eclipse.swt.graphics.Image;
import org.eclipse.swt.layout.FillLayout;
import org.eclipse.swt.widgets.Display;
import org.eclipse.swt.widgets.Event;
import org.eclipse.swt.widgets.Listener;
import org.eclipse.swt.widgets.Shell;
import org.eclipse.swt.widgets.Tree;
import org.eclipse.swt.widgets.TreeItem;

public class Main {
    public static void main(String[] args) {
        Display display = new Display();
        Shell shell = new Shell(display);
        shell.setSize(400, 300);
        shell.setLayout(new FillLayout());

        Tree tree = new Tree(shell, SWT.VIRTUAL);
        tree.addListener(SWT.SetData, new Listener() {
            public void handleEvent(final Event e) {
            	TreeItem item = (TreeItem)e.item;
                item.setText(0, "A");
                try {
                  Thread.sleep(1000);
                }
                catch ( InterruptedException ex ) {
                  throw new RuntimeException(ex);
                }
                item.setImage(new Image(display, 20, 20)); // <-- this is the critical line!
            }
        });
        tree.setItemCount(1);
        
        shell.open();

        while (!shell.isDisposed()) {
            if (!display.readAndDispatch()) {
                display.sleep();
            }
        }
        display.dispose();
    }
}

@basilevs
Copy link
Contributor

@om-11-2024 does wrapping setImage() in Display.asyncExec() fix your snippet?

@om-11-2024
Copy link

Yes, this fixes the bug in the snippet.

@om-11-2024
Copy link

Here is what causes the error in the swt code.

The problem is inside org.eclipse.swt.widgets.Tree.cellDataProc().

Here is the stacktrace to Tree.cellDataProc() in this snippet:

Tree.cellDataProc(long, long, long, long, long)
Display.cellDataProc(long, long, long, long, long) line: 995	
OS.swt_fixed_resize(long, long, int, int) line: not available [native method]	
Tree(Scrollable).resizeHandle(int, int) line: 554	
Tree(Composite).resizeHandle(int, int) line: 1600	
Tree(Control).setBounds(int, int, int, int, boolean, boolean) line: 1122	
Tree(Composite).setBounds(int, int, int, int, boolean, boolean) line: 1658	
Tree.setBounds(int, int, int, int, boolean, boolean) line: 3641	
Tree(Control).setBounds(int, int, int, int) line: 1019	
FillLayout.layout(Composite, boolean) line: 216	
Shell(Composite).updateLayout(boolean) line: 1877	
Shell.setVisible(boolean) line: 3011	
Shell.open() line: 2032	
Main.main(String[]) line: 36

Here is the problem in Tree.cellDataProc() code:

@Override
long cellDataProc (long tree_column, long cell, long tree_model, long iter, long data) {
	...
	boolean setData = false;
	...
	if ((style & SWT.VIRTUAL) != 0) {
		...
		setData = checkData (item); // <= HERE new renderers are created, and the current renderer is disposed + it's memory is freed
		...
	}
	...
	long [] ptr = new long [1];
	if (setData) {
			...
			GTK.gtk_tree_model_get (tree_model, iter, modelIndex + CELL_TEXT, ptr, -1);
			if (ptr [0] != 0) {
				OS.g_object_set (cell, OS.text, ptr[0], 0); // <= HERE is an attempt to set property for the already disposed current renderer
				OS.g_free (ptr[0]);
			}
		}
	}

The problem is that:

  1. at first the current renderer (it's address passed to the method in the cell argument) is disposed inside checkData (item)
  2. later there is an attempt to set a property to the already disposed current renderer

Here is how the current renderer is released inside Tree.checkData():

  1. SWT.SetData is triggered:
    boolean checkData (TreeItem item) {
    	...
    	if ((style & SWT.VIRTUAL) != 0) {
    		...
    		sendEvent (SWT.SetData, event);
    		...
    	}
    	return true;
  2. our handler in the snippet invokes TreeItem.setImage()
  3. inside TreeItem.setImage() there is an invocation of Tree.createRenderers()
    public void setImage(int index, Image image) {
    	...
    	if (!parent.pixbufSizeSet) {
    		if (image != null) {
    			...
    				if ((parent.style & SWT.VIRTUAL) != 0) {
    					...
    					parent.createRenderers(column, modelIndex, check, parent.style);
    				}
    	...
  4. Tree.createRenderers() calls GTK.gtk_tree_view_column_clear (columnHandle), which calls g_object_unref() on the renderer, which results in the renderer disposal + frees it's memory.

@om-11-2024
Copy link

Because cell is used as the 1st argument in OS.g_object_set (cell, OS.text, ptr[0], 0).

Initially at the beginning of Tree.cellDataProc() the cell contains the pointer to the renderer.

Later in Tree.cellDataProc() there is a call checkData (item), which calls GTK.gtk_tree_view_column_clear (columnHandle) which disposes all renderers for the column (including the renderer which cell points to).

By the time we get to OS.g_object_set (cell, OS.text, ptr[0], 0); the renderer that cell points to is already disposed.

@basilevs
Copy link
Contributor

basilevs commented Dec 2, 2024

@om-11-2024 I assumed renderers are reference-counted, so cell object should hold its own reference to renderer, preventing the disposal. Could you send a link to GTK code managing cell object?

@om-11-2024
Copy link

om-11-2024 commented Dec 2, 2024

Sure, here are some links:

And here is native stacktrace for GTK.gtk_tree_view_column_clear() which leads to g_object_unref() call for the renderer.

g_object_unref() at gobject.c:4 381 0x731907fb46d0	
cell_info_free() at gtkcellareabox.c:412 0x731905725e8c	
gtk_cell_area_box_remove() at gtkcellareabox.c:1 121 0x7319057277bb	
gtk_cell_area_remove() at gtkcellarea.c:1 671 0x73190571e8d4	
gtk_cell_area_clear() at gtkcellarea.c:1 502 0x73190571e263	
gtk_cell_layout_clear() at gtkcelllayout.c:415 0x73190572e1eb	
gtk_cell_layout_default_clear() at gtkcelllayout.c:236 0x73190572d9d8	
gtk_cell_layout_clear() at gtkcelllayout.c:415 0x73190572e1eb	
gtk_tree_view_column_clear() at gtktreeviewcolumn.c:1 693 0x731905ac6f2e	
Java_org_eclipse_swt_internal_gtk_GTK_gtk_1tree_1view_1column_1clear() at 0x731906047569	

This stacktrace is also for this snippet

Inside this g_object_unref() the number of references for the renderer reached 0 and it was disposed.

So yes, the renderer is reference-counted, its owner is column.
When GTK.gtk_tree_view_column_clear() is called, then the column executes g_object_unref() for its renderers (because it no longer needs them), their ref-counters reache 0 and the are disposed.

@basilevs
Copy link
Contributor

basilevs commented Dec 2, 2024

@om-11-2024 do you think Tree.cellDataProc is called from apply_cell_attributes?
Then reserving a reference inside our code would not help, as g_object_thaw_notify still accesses the renderer later.
So #1612 may still be a way to go, unless we want to rework the ownership of renderers.

@om-11-2024
Copy link

om-11-2024 commented Dec 3, 2024

@om-11-2024 do you think Tree.cellDataProc is called from apply_cell_attributes?

It is. Here is the stacktrace with both java and c frames:

// java
  Tree.cellDataProc(long, long, long, long, long) line: 277	
  Display.cellDataProc(long, long, long, long, long) line: 995	

// native
  apply_cell_attributes() at gtkcellarea.c:1 257 0x7679b4b1da3b	
  g_hash_table_foreach() at ghash.c:2 117 0x7679b5ed0e8b	
  gtk_cell_area_real_apply_attributes() at gtkcellarea.c:1 286 0x7679b4b1db55	
  gtk_cell_area_box_apply_attributes() at gtkcellareabox.c:1 310 0x7679b4b27dc5	
  _gtk_marshal_VOID__OBJECT_BOXED_BOOLEAN_BOOLEANv() at gtkmarshalers.c:5 447 0x7679b4aa82e2	
  g_type_class_meta_marshalv() at gclosure.c:1 062 0x7679e0042f7e	
  _g_closure_invoke_va() at gclosure.c:897 0x7679e0042a82	
  signal_emit_valist_unlocked() at gsignal.c:3 424 0x7679e006581f	
  g_signal_emit_valist() at gsignal.c:3 263 0x7679e006516e	
  g_signal_emit() at gsignal.c:3 583 0x7679e0066b43	
  gtk_cell_area_apply_attributes() at gtkcellarea.c:2 373 0x7679b4b20b52	
  gtk_tree_view_column_cell_set_cell_data() at gtktreeviewcolumn.c:2 821 0x7679b4ec9cfc	
  validate_row() at gtktreeview.c:6 432 0x7679b4eaa845	
  initialize_fixed_height_mode() at gtktreeview.c:6 884 0x7679b4eabf24	
  do_validate_rows() at gtktreeview.c:6 931 0x7679b4eac0ac	
  gtk_tree_view_get_preferred_width() at gtktreeview.c:2 683 0x7679b4e9fc39	
  gtk_widget_query_size_for_orientation() at gtksizerequest.c:181 0x7679b4dccdac	
  gtk_widget_compute_size_for_orientation() at gtksizerequest.c:399 0x7679b4dcd6e6	
  gtk_widget_get_preferred_width() at gtksizerequest.c:492 0x7679b4dcd978	
  gtk_scrolled_window_measure() at gtkscrolledwindow.c:1 849 0x7679b4da5819	
  gtk_css_custom_gadget_get_preferred_size() at gtkcsscustomgadget.c:124 0x7679b4b7d79c	
  gtk_css_gadget_get_preferred_size() at gtkcssgadget.c:683 0x7679b4b84da9	
  gtk_scrolled_window_get_preferred_width() at gtkscrolledwindow.c:4 193 0x7679b4dac6da	
  gtk_widget_query_size_for_orientation() at gtksizerequest.c:181 0x7679b4dccdac	
  gtk_widget_compute_size_for_orientation() at gtksizerequest.c:399 0x7679b4dcd6e6	
  gtk_widget_get_preferred_width_for_height() at gtksizerequest.c:565 0x7679b4dcdbb6	
  _gtk_widget_get_preferred_size_and_baseline() at gtksizerequest.c:708 0x7679b4dcdfe0	
  gtk_widget_get_preferred_size() at gtksizerequest.c:750 0x7679b4dce059	
  0x7679b5452269	// this is `swt_fixed_size_allocate()` from swt
  gtk_widget_size_allocate_with_baseline() at gtkwidget.c:6 179 0x7679b4ed8cfb	
  gtk_widget_size_allocate() at gtkwidget.c:6 260 0x7679b4ed9081	
  swt_fixed_resize() at 0x7679b54527d2	
  Java_org_eclipse_swt_internal_gtk_OS_swt_1fixed_1resize() at 0x7679b544e76c	

// java
OS.swt_fixed_resize(long, long, int, int) line: not available [native method]	
  Tree(Scrollable).resizeHandle(int, int) line: 554	
  Tree(Composite).resizeHandle(int, int) line: 1600	
  Tree(Control).setBounds(int, int, int, int, boolean, boolean) line: 1122	
  Tree(Composite).setBounds(int, int, int, int, boolean, boolean) line: 1658	
  Tree.setBounds(int, int, int, int, boolean, boolean) line: 3641	
  Tree(Control).setBounds(int, int, int, int) line: 1019	
  FillLayout.layout(Composite, boolean) line: 216	
  Shell(Composite).updateLayout(boolean) line: 1877	
  Shell.setVisible(boolean) line: 3011	
  Shell.open() line: 2032	
  Main.main(String[]) line: 36	

Then reserving a reference inside our code would not help, as g_object_thaw_notify still accesses the renderer later.

Agree.

So #1612 may still be a way to go, unless we want to rework the ownership of renderers.

Maybe.
For now assessing #1612 it's hard to me.
The main problem is that this bug is caused by bugfix for 513761480261.
That bugfix added Tree.createRenderers() inside TreeItem.setImage(), which was a mistake because:

  • TreeItem.setImage() and TreeItem.setText() are used in SWT.SetData event handlers by design
  • SWT.SetData is used in Tree.cellDataProc() (it's a cell data function - i.e. a function which gets invoked on each cell right before its rendering and which sets what to render: text, image, color, etc.) with SWT.VIRTUAL also by design.

I'm not a fan of the fact that #1612 adds asynchronicity to TreeItem.setImage() - imo this makes the code even more complicated. On the other hand, #1612 works and is a better bugfix for 513761 than what we have now.

@basilevs
Copy link
Contributor

basilevs commented Dec 3, 2024

Indexing options for further discussion:

  1. async recreation of renderers
  2. move renderer ownership around (TreeColumn?)
  3. reconfigure renderer instead of recreating it

@basilevs
Copy link
Contributor

basilevs commented Dec 3, 2024

The main problem is that this bug is caused by bugfix for 513761.

Could you recheck the bug number? I'm sure the bug is introduced in fix for 480261 instead. Here is the commit.

@om-11-2024
Copy link

Yes, it's 480261, not 513761.

zkxvh added a commit to zkxvh/eclipse.platform.swt that referenced this issue Dec 13, 2024
…latform#678

Fix and test for bug eclipse-platform#678

Fixes eclipse-platform#678

Reproducing the crash:
- eclipse-platform#678 (comment)
- eclipse-platform#1611
- eclipse-platform#678 (comment)

The cause of the crash is described here:
eclipse-platform#678 (comment)
In short, the problem was executing `Tree.createRenderers()` inside
`TreeItem.setImage()`.
The sequence of action leading to the crash was:
in a `Tree` with `SWT.VIRTUAL` a `TreeItem` is rendered for the first
time or after `clear()`
  `Tree.cellDataProc()` is executed for the item and one of it's
renderers
    `Tree.checkData(item)` is called for the item
      `SWT.SetData` is invoked and executes `TreeItem.setImage()`
         `Tree.createRenderers()` executes and disposes the current
renderer
    further actions in `Tree.cellDataProc()` that access the
already-disposed renderer (they think it's alive)

How it's fixed:
1. set fixed height+width to pixbuf renderers with
`GTK.gtk_cell_renderer_set_fixed_size`
   This does 2 things:
     - from now on it makes images rendered by the renderer to be
rendered with these height+width
     - from now on fixed row height calculation will return height of at
least fixed height of the pixbuf
2. make `GtkTreeView` recompute it's internally stored fixed row height
with re-setting `fixed_height_mode` of the GtkTreeView
3. make all existing tree rows update their height by invoking
`gtk_tree_view_row_changed()` on each of the rows.

(2) and (3) can also be achieved by creating a copy of the current
`GtkTreeModel` and assigning it to the `GtkTreeView`.
But this also resets current selected rows, focus and scroll position;
that's why I chose re-setting `fixed_height_mode` +
`gtk_tree_view_row_changed()` instead.
zkxvh added a commit to zkxvh/eclipse.platform.swt that referenced this issue Dec 13, 2024
…latform#678

Fix and test for bug eclipse-platform#678

Fixes eclipse-platform#678

Reproducing the crash:
- eclipse-platform#678 (comment)
- eclipse-platform#1611
- eclipse-platform#678 (comment)

The cause of the crash is described here:
eclipse-platform#678 (comment)
In short, the problem was executing `Tree.createRenderers()` inside
`TreeItem.setImage()`.
The sequence of action leading to the crash was:
in a `Tree` with `SWT.VIRTUAL` a `TreeItem` is rendered for the first
time or after `clear()`
  `Tree.cellDataProc()` is executed for the item and one of it's
renderers
    `Tree.checkData(item)` is called for the item
      `SWT.SetData` is invoked and executes `TreeItem.setImage()`
         `Tree.createRenderers()` executes and disposes the current
renderer
    further actions in `Tree.cellDataProc()` that access the
already-disposed renderer (they think it's alive)

How it's fixed:
1. set fixed height+width to pixbuf renderers with
`GTK.gtk_cell_renderer_set_fixed_size`
   This does 2 things:
     - from now on it makes images rendered by the renderer to be
rendered with these height+width
     - from now on fixed row height calculation will return height of at
least fixed height of the pixbuf
2. make `GtkTreeView` recompute it's internally stored fixed row height
with re-setting `fixed_height_mode` of the GtkTreeView
3. make all existing tree rows update their height by invoking
`gtk_tree_view_row_changed()` on each of the rows.

(2) and (3) can also be achieved by creating a copy of the current
`GtkTreeModel` and assigning it to the `GtkTreeView`.
But this also resets current selected rows, focus and scroll position;
that's why I chose re-setting `fixed_height_mode` +
`gtk_tree_view_row_changed()` instead.
@zkxvh
Copy link

zkxvh commented Dec 13, 2024

I looked into the code of swt and gtk here is what I found out:

  • in swt a tree with SWT.VIRTUAL maps to a GtkTreeView with fixed-height-mode=true in gtk
  • before the 1st image for any TreeItem is set there is some fixed row height already.
  • when the 1st image is set for any TreeItem, then
    -pixbufSizeSet,pixbufHeight andpixbufWidth fields are set for Tree
    • if the height of the image is greater than the current fixed row height, then then the current fixed row height should be increased and the tree redrawn.
  • from now on any image set for any item of the tree should have the same size as the 1st image

The fixed row row height for a GtkTreeView is stored in _GtkTreeViewPrivate.fixed_height and is computed in initialize_fixed_height_mode.
The actual row height is stored in _GtkTreeViewPrivate.tree: every row corresponds to a GtkRBNode, and the height is in _GtkRBNode.offset.

So basically what we need to do is: when the 1st image is inserted to the tree, we need to:

  • make GtkTreeView recompute its fixed_height field
  • make existing rows update their _GtkRBNode.offset fields.

I found 2 ways to do that in gtk code:

  1. gtk_tree_view_set_model: i.e. one can create a copy of the current GtkTreeModel and set it to the tree, then everything will be recomputed
  2. re-set fixed-height-mode property of the GtkTreeView (this recomputes _GtkTreeViewPrivate.fixed_height) + execute gtk_tree_view_row_changed on each existing row (this recomputes _GtkRBNode.offset for existing rows)

I chose option 2 in #1662 simply because it doesn't resets current selection, focus and scroll position for the tree.

P.S. It turns out that using Display.asyncExec() is fine in this case (actually I had to use it).
Actually GTK itself actively uses asynchronous tasks, e.g. all the drawing and validation happens asynchronously.
So heavy updates of the tree (like in this case) should be executed asynchronously and not inside Tree.cellDataProc().

zkxvh added a commit to zkxvh/eclipse.platform.swt that referenced this issue Dec 16, 2024
…latform#678

Added code formatting changed to match the formatting of the rest of the
SWT code.
@zkxvh
Copy link

zkxvh commented Dec 18, 2024

I tried to fix the bug by changing TreeItem.setImage(...) in #1662.
This bug was fixed, but I got the same process crash in Eclipse RCP in a code that uses Tree but with slightly different error message:

Bail out! Gtk:ERROR:../gtk/gtktreestore.c:597:gtk_tree_store_get_path: assertion failed: (G_NODE (iter->user_data)->parent != NULL)
**
Gtk:ERROR:../gtk/gtktreestore.c:597:gtk_tree_store_get_path: assertion failed: (G_NODE (iter->user_data)->parent != NULL)

Stacktrace is this:

	org.eclipse.swt.widgets.TreeItem.getParentItem(TreeItem.java:866)
	at org.eclipse.jface.viewers.TreeViewer.getParentItem(TreeViewer.java:228)
	at org.eclipse.jface.viewers.AbstractTreeViewer.getTreePathFromItem(AbstractTreeViewer.java:3046)
	at org.eclipse.jface.viewers.AbstractTreeViewer.internalFindItem(AbstractTreeViewer.java:211)
	at org.eclipse.jface.viewers.AbstractTreeViewer.internalFindItems(AbstractTreeViewer.java:188)
	at org.eclipse.jface.viewers.TreeViewer.replace(TreeViewer.java:435)
	at TestLazyContentProvider.updateElement(TestLazyContentProvider.java:100)
	at org.eclipse.jface.viewers.TreeViewer.virtualLazyUpdateWidget(TreeViewer.java:1001)
	at org.eclipse.jface.viewers.TreeViewer.lambda$1(TreeViewer.java:260)
	at org.eclipse.swt.widgets.EventTable.sendEvent(EventTable.java:89)
	at org.eclipse.swt.widgets.Display.sendEvent(Display.java:5855)
	at org.eclipse.swt.widgets.Widget.sendEvent(Widget.java:1529)
	at org.eclipse.swt.widgets.Widget.sendEvent(Widget.java:1555)
	at org.eclipse.swt.widgets.Widget.sendEvent(Widget.java:1538)
	at org.eclipse.swt.widgets.Tree.checkData(Tree.java:372)
	at org.eclipse.swt.widgets.Tree.cellDataProc(Tree.java:304)
	at org.eclipse.swt.widgets.Display.cellDataProc(Display.java:995)
	at org.eclipse.swt.internal.gtk.GTK.gtk_tree_store_clear(Native Method)
	at org.eclipse.swt.widgets.Tree.removeAll(Tree.java:2997)
	at org.eclipse.jface.viewers.TreeViewer.removeAll(TreeViewer.java:289)
	at org.eclipse.jface.viewers.AbstractTreeViewer.lambda$1(AbstractTreeViewer.java:1593)
	at org.eclipse.jface.viewers.StructuredViewer.preservingSelection(StructuredViewer.java:1392)
	at org.eclipse.jface.viewers.TreeViewer.preservingSelection(TreeViewer.java:367)
	at org.eclipse.jface.viewers.StructuredViewer.preservingSelection(StructuredViewer.java:1353)
	at org.eclipse.jface.viewers.AbstractTreeViewer.inputChanged(AbstractTreeViewer.java:1589)
	at org.eclipse.jface.viewers.ContentViewer.setInput(ContentViewer.java:282)
	at org.eclipse.jface.viewers.StructuredViewer.setInput(StructuredViewer.java:1636)

This is a very similar bug.
Again we get:

Display.cellDataProc(...)
  ↳ Tree.checkData(...)
    ↳ sendEvent(SWT.SetData, ...)
      ↳ a callback for SWT.SetData is invoked, it adds/removes columns/rows/renderers/other tree components
  ↳ the code in Display.cellDataProc(...) continues execution and crashes when it accesses removed/modified tree components

The problem again is that a SWT.SetData callback changes the structure of the tree, while Display.cellDataProc(...) (where the callback is called from) is a cell data function in GTK terms which should only set renderers' properties like text, image, etc. and should not change tree structure.

Since callbacks for SWT.SetData can contain any code, I think that the better fix would be changing Display.cellDataProc(...) to postpone sending SWT.SetData until the method finishes.
That would fix this bug and also all other similar bugs that crash the application because of Tree.

zkxvh added a commit to zkxvh/eclipse.platform.swt that referenced this issue Jan 9, 2025
Fix for `SIGSEGV` in `Tree.cellDataProc(...)` when calling
`TreeItem.setImage(...)`.

Fixes eclipse-platform#678

Reproducing the crash:
- eclipse-platform#678 (comment)
- eclipse-platform#1611
- eclipse-platform#678 (comment)

The cause of the crash is described here:
eclipse-platform#678 (comment)
In short, the crash happens due to read accesses to an already disposed
renderer.
The sequence of action leading to the crash was:
- in a `Tree` with `SWT.VIRTUAL` a `TreeItem` is rendered for the first
time or after `clear()`
  - `Tree.cellDataProc()` is executed for the item and one of the
renderers
    - `Tree.checkData(item)` is called for the item
      - `SWT.SetData` event is created and sent for the item
        - `TreeItem.setImage() is executed by the event handler for
`SWT.SetData``
         - `Tree.createRenderers()` executes and disposes the current
renderer
    - further actions in `Tree.cellDataProc()` that access the
already-disposed renderer (they think it's alive)

How it's fixed: in `Tree.cellDataProc()` wrap `Tree.checkData(item)`
into `Display.asyncExec()`.

Why fixed this way:
1. on one hand, `Tree.cellDataProc()` is a [cell data
function](https://docs.gtk.org/gtk3/treeview-tutorial.html#cell-data-functions)
which is not supposed to change tree structure. Violation of this leads
to C memory errors.
2. On the other hand, `SWT.SetData` event handlers are written by swt
users and therefore can contain any code.

Using `Display.asyncExec()` to postpone `SWT.SetData` event handlers
until `Tree.cellDataProc()` is finished seems like the most simple and
bullet-proof solution to eclipse-platform#678 and all similar bugs.
@zkxvh zkxvh linked a pull request Jan 9, 2025 that will close this issue
zkxvh added a commit to zkxvh/eclipse.platform.swt that referenced this issue Jan 10, 2025
Fix for `SIGSEGV` in `Tree.cellDataProc(...)` when calling
`TreeItem.setImage(...)`.

Fixes eclipse-platform#678

Reproducing the crash:
- eclipse-platform#678 (comment)
- eclipse-platform#1611
- eclipse-platform#678 (comment)

The cause of the crash is described here:
eclipse-platform#678 (comment)
In short, the crash happens due to read accesses to an already disposed
renderer.
The sequence of action leading to the crash was:
- in a `Tree` with `SWT.VIRTUAL` a `TreeItem` is rendered for the first
time or after `clear()`
  - `Tree.cellDataProc()` is executed for the item and one of the
renderers
    - `Tree.checkData(item)` is called for the item
      - `SWT.SetData` event is created and sent for the item
        - `TreeItem.setImage() is executed by the event handler for
`SWT.SetData``
         - `Tree.createRenderers()` executes and disposes the current
renderer
    - further actions in `Tree.cellDataProc()` that access the
already-disposed renderer (they think it's alive)

How it's fixed: in `Tree.cellDataProc()` wrap `Tree.checkData(item)`
into `Display.asyncExec()`.

Why fixed this way:
1. on one hand, `Tree.cellDataProc()` is a [cell data
function](https://docs.gtk.org/gtk3/treeview-tutorial.html#cell-data-functions)
which is not supposed to change tree structure. Violation of this leads
to C memory errors.
2. On the other hand, `SWT.SetData` event handlers are written by swt
users and therefore can contain any code.

Using `Display.asyncExec()` to postpone `SWT.SetData` event handlers
until `Tree.cellDataProc()` is finished seems like the most simple and
bullet-proof solution to eclipse-platform#678 and all similar bugs.
zkxvh added a commit to zkxvh/eclipse.platform.swt that referenced this issue Jan 10, 2025
Fix for `SIGSEGV` in `Tree.cellDataProc(...)` when calling
`TreeItem.setImage(...)`.

Fixes eclipse-platform#678

Reproducing the crash:
- eclipse-platform#678 (comment)
- eclipse-platform#1611
- eclipse-platform#678 (comment)

The cause of the crash is described here:
eclipse-platform#678 (comment)
In short, the crash happens due to read accesses to an already disposed
renderer.
The sequence of action leading to the crash was:
- in a `Tree` with `SWT.VIRTUAL` a `TreeItem` is rendered for the first
time or after `clear()`
  - `Tree.cellDataProc()` is executed for the item and one of the
renderers
    - `Tree.checkData(item)` is called for the item
      - `SWT.SetData` event is created and sent for the item
        - `TreeItem.setImage() is executed by the event handler for
`SWT.SetData``
         - `Tree.createRenderers()` executes and disposes the current
renderer
    - further actions in `Tree.cellDataProc()` that access the
already-disposed renderer (they think it's alive)

How it's fixed: in `Tree.cellDataProc()` wrap `Tree.checkData(item)`
into `Display.asyncExec()`.

Why fixed this way:
1. on one hand, `Tree.cellDataProc()` is a [cell data
function](https://docs.gtk.org/gtk3/treeview-tutorial.html#cell-data-functions)
which is not supposed to change tree structure. Violation of this leads
to C memory errors.
2. On the other hand, `SWT.SetData` event handlers are written by swt
users and therefore can contain any code.

Using `Display.asyncExec()` to postpone `SWT.SetData` event handlers
until `Tree.cellDataProc()` is finished seems like the most simple and
bullet-proof solution to eclipse-platform#678 and all similar bugs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
9 participants