Hi guys! I have KDE Neon 6.2.3 running on my Surface Pro 7+, and up until now it's been running rather smoothly, for years. But today I seem to be unable to fully boot anymore on normal mode.
Linux will get stuck on the KDE spinning gear, with the gear frozen. And it becomes unresponsive, to open alternative consoles with control+alt+Fx, or even control+alt+del. The only thing that works is a force shutdown by holding the power button.
I tried with a previous kernel, with the same results (current is linux-image-6.10.10-surface-1, I have a generic 6.8.0-49, but it also freezes). I can boot in safe mode. It opens a different login screen, and after a failed attempt of opening lxde, it fails back to my usual KDE session, in safe graphics mode.
What can I test? How can I see what failed at boot?
Thanks. I pulled the journalctl. This is...A bit extensive. Just the entries from today are about 4MB in text. I'm having a very hard time finding the moment I force it off and then start again. Any good troubleshooting guide on how to find crashes?
Just an update. Today I made a rescuezilla full backup of the whole drive, and then wiped and installed KDE Neon brand new from Dec 1st release from their page. Guess what...that one ALSO fails to boot.
OK so just to confirm: you get a boot freeze even on a fresh install before you restore a backup?
Are you able to boot a liveusb?
I would try another distro just to rule out a software problem but it sounds like hardware failure. I would test your ram, graphics card, and storage medium. Do you have an SSD or an HDD?
I actually clicked on the replace partition option on the installer, instead of a full disk wipe. The profile did look brand new once I managed to login in safe mode, but I didn't stay too long to check, as I just concluded hell, if it only boots in safe mode I might as well continue troubleshooting on the original install.
Anyway...After going back to the original image (as I had it backed up anyway), I paid a bit more attention at that weird popup on safe mode as I logged in, mentioning it failed to launch startlxde. So I ran
sudo dpkg --reconfigure sddm
And on the menu, for some reason sddm wasn't the first choice. After choosing it again, this time it went straight to the boot screen. Some other day I'll check again whichever random login menu is installed, and remove it, as it clearly causes issues.
About that - I´d like to run a backup that takes all apps and their configuration files (more than my own docs and files, which I already have properly backed in Seafile). What would you recommend just for the apps, their configs (and maybe the repos) etc?
I think you're gonna find that a hard task. There isn't one universal directory where apps saves their configs and bin files. You'll find some in ~/.config
Dec 03 16:27:12 SurfaceNeon wpa_supplicant[1436]: wlp0s20f3: CTRL-EVENT-REGDOM-CHANGE init=DRIVER type=COUNTRY alpha2=HK
Dec 03 16:27:12 SurfaceNeon NetworkManager[1428]: <info> [1733214432.6906] manager: startup complete
Dec 03 16:27:12 SurfaceNeon systemd[1]: Finished NetworkManager-wait-online.service - Network Manager Wait Online.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Reached target network-online.target - Network is Online.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Started update-notifier-download.timer - Download data for packages that failed at
package install time.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Started update-notifier-motd.timer - Check to see whether there is a new version of Ubuntu available.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Reached target timers.target - Timer Units.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Started cups-browsed.service - Make remote CUPS printers available locally.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Started deluged.service - Deluge Bittorrent Client Daemon.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Reached target multi-user.target - Multi-User System.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Reached target graphical.target - Graphical Interface.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Starting systemd-update-utmp-runlevel.service - Record Runlevel Change in UTMP...
Dec 03 16:27:12 SurfaceNeon systemd[1]: systemd-update-utmp-runlevel.service: Deactivated successfully.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Finished systemd-update-utmp-runlevel.service - Record Runlevel Change in UTMP.
Dec 03 16:27:12 SurfaceNeon systemd[1]: Startup finished in 4.484s (firmware) + 6.837s (loader) + 3.208s (kernel) + 11.614s (userspace) = 26.145s.
Dec 03 16:27:12 SurfaceNeon systemd[1]: dmesg.service: Deactivated successfully.
Dec 03 16:27:14 SurfaceNeon ModemManager[1454]: <msg> [base-manager] couldn't check support for device '/sys/devices/pci0000:00/0000:00:14.3': not supported by any plugin
Dec 03 16:27:20 SurfaceNeon systemd[1]: NetworkManager-dispatcher.service: Deactivated successfully.
Dec 03 16:27:20 SurfaceNeon systemd-logind[1351]: Power key pressed short.
Dec 03 16:27:20 SurfaceNeon systemd-logind[1351]: Powering off...
Dec 03 16:27:20 SurfaceNeon systemd-logind[1351]: System is powering down.
Yeah...It looks to me like it just says it went alright, right until I hit the poweroff button, because it's actually completely crashed.
Just an update. Today I made a rescuezilla full backup of the whole drive, and then wiped and installed KDE Neon brand new from Dec 1st release from their page. Guess what...that one ALSO fails to boot.
If a clean install also fails then I would start considering hardware fault.
You could always check if you can boot Fedora Workstation with GNOME just in case but I think you should start looking for a replacement PC.
I actually clicked on the replace partition option on the installer, instead of a full disk wipe. The profile did look brand new once I managed to login in safe mode, but I didn't stay too long to check, as I just concluded hell, if it only boots in safe mode I might as well continue troubleshooting on the original install.
Anyway...After going back to the original image (as I had it backed up anyway), I paid a bit more attention at that weird popup on safe mode as I logged in, mentioning it failed to launch startlxde. So I ran
sudo dpkg --reconfigure sddm
And on the menu, for some reason sddm wasn't the first choice. After choosing it again, this time it went straight to the boot screen. Some other day I'll check again whichever random login menu is installed, and remove it, as it clearly causes issues.
It gives you a webUI that you can use to check out logs and services (among other things) and makes it a lot easier to troubleshoot computer troubles where the machine starts but your GUI doesn't.