[Gluster-users] QEMU gfapi segfault

Josh Boon gluster at joshboon.com
Fri Jan 2 19:07:48 UTC 2015


Another machine gave up the ghost and I got an apport crash with an incomplete core dump. You can download it from here  https://onedrive.live.com/redir?resid=60BD302DEC1727F0!21858&authkey=!ACzDg-J6cOhLFK0&ithint=file%2ccrash and open it in your favorite text editor for some idea of what my system was doing at the time. From my understanding of how core dumps work you'd want the full machine memory but all of my machines that have crashed are in the 8GB to 24GB range so I'm not sure how to handle one of those core dumps should I get one. Thoughts? 


----- Original Message -----
From: "Josh Boon" <gluster at joshboon.com>
To: "Vijay Bellur" <vbellur at redhat.com>
Cc: "Gluster-users at gluster.org List" <gluster-users at gluster.org>
Sent: Wednesday, December 31, 2014 7:24:21 PM
Subject: Re: [Gluster-users] QEMU gfapi segfault

Not this time around. I've increased the limits as these machines are rather big for ram requirements. 
----- Original Message -----
From: "Vijay Bellur" <vbellur at redhat.com>
To: "Josh Boon" <gluster at joshboon.com>, "Gluster-users at gluster.org List" <gluster-users at gluster.org>
Sent: Wednesday, December 31, 2014 4:06:09 PM
Subject: Re: [Gluster-users] QEMU gfapi segfault

On 12/31/2014 04:11 AM, Josh Boon wrote:
> Hey folks,
>
> I'm working on tracking down rogue QEMU segfaults in my infrastructure
> that look to be dying due to gluster. The tips that I get is that the
> process is in disk sleep when it dies and the process is backed only by
> gluster and the segfault lends to io system issues. Unfortunately I
> haven't figured out how to get a full crash dump so I can run it through
> apport-retrace to get exactly what went wrong. The other interesting
> thing is this happens only when gluster is under heavy load. Any tips
> about debugging further or getting this fixed up would be appreciated.
>
> Segfault:
>
> Dec 30 20:42:56 HFMHVR3 kernel: [5976247.820875] qemu-system-x86[27730]:
> segfault at 128 ip 00007f891f0cc82c sp 00007f89376846a0 error 4 in
> qemu-system-x86_64 (deleted)[7f891ed42000+4af000]
>

Do you see a qemu core dump file? If yes, can you please post the backtrace?

-Vijay



_______________________________________________
Gluster-users mailing list
Gluster-users at gluster.org
http://www.gluster.org/mailman/listinfo/gluster-users


More information about the Gluster-users mailing list