Announcing a new -ck release, 4.17-ck1 with the latest version of the Multiple Queue Skiplist Scheduler, version 0.172. These are patches designed to improve system responsiveness and interactivity with specific emphasis on the desktop, but configurable for any workload.
This is just a resync with 4.16 MuQSS and -ck patches.
Thanks a lot!ReplyDelete
Thank you very much!ReplyDelete
I also love the tunables.
echo 1 > /proc/sys/kernel/hrtimeout_min_us
echo 15 > /proc/sys/kernel/hrtimer_granularity_us
echo 0 > /proc/sys/kernel/yield_type
I get an unbootable system when attempting to boot with 4.17-ck1 patched. Coming out of grub2, I see only, "Loading initial ramdisk ..." and a corresponding freeze.ReplyDelete
You could compile in drivers for harddisk-controller, filesystem(s), usb, video, etc. and go without initial ramdisk.Delete
I heard of people running into problems with kernels > 4.16.x while using xz compression. Their problem went away after switching to old gzip.Delete
http://blog.fefe.de/?ts=a5bcaff9 (site is in German)
I got problem with >4.16 (i think because of new Intel microcode) with >HZ_300.Delete
Ok. What problem exactly?Delete
I suspect there may be a problem in the interaction between MuQSS and recent developments in BFQ (either SQ or MQ).ReplyDelete
- CFQ + CFS = Fine
- BFQ + CFS = Fine
- BFQ + MuQSS = Kernel panics, randomly.
Haven't tried CFQ + MuQSS yet though. Building the kernel on this box is going to end up being dreadful and Liquorix have resorted to dumping all non-BFQ IO schedulers.
I can't be entirely certain but I think it may somehow be related to cgroups.
- Kyber + MuQSS also seems fine so it's probably not the blk-mq infrastructure itself.
I have a similar situation. Only I no have panic - the computer just hangs with the large input / output activity.
Actually, it probably is kernel panics. For the longest time I wasn't sure what it was as the window manager and desktop environment would just completely die.Delete
But then, one time, I got lucky and it happened while I was in a virtual TTY (CTRL+ALT+Fx). So, I could finally see what was happening. Kernel panic in BFQ.
Why wouldn't you all post the call traces of those panics?..Delete
Because 99% of them happen with the Window Manager and DE having focus/control. So, nothing to write down. Literally, all control I have left after these crashes... is physically pulling the plug.Delete
Literally, the PC is dead. Cutting power is all that works.
And with it being a kernel panic, no log is actually written (for obvious reasons).
So, it would have to be by hand, by definition. And the one time I did see the panic, well, I just didn't think about writing down every little bit of a console screen full of information.
Shoot me, never seen the kernel panic before. Part of the reason why I like Linux so much. It usually never does this. Unlike Windows.
Valid point. If I had access to a second PC or anything else that can be properly configured as a receiver. Which I don't.Delete
Closest candidate would be the modem/router but it's a proprietary ISP box so, not enough low level access.
Looks like no dice there.
Besides, the panics are very random in nature. It can go for an entire day without it crashing but it can also crash back to back. So, no longer using the combination of MuQSS and BFQ.
I wish I could help in pinpointing the issue more precisely but for now, I am forced to simply conclude that in my case it is not production ready and I do kind of need this machine to actually work.
Again, it's a perfectly valid point you're making. Just not one that is available to me at this time. Sorry.
You can use any remote box with public IP and ability to receive UDP packets on some port. Or you can ask someone to accept those packets for you.Delete
Looking into options there. In the meanwhile, using BFQ + CFS for well over a week now without seeing a single issue.Delete
Definitely related to the specific combination of BFQ + MuQSS. Again, looking into available netconsole options.
Kind of new to kernel panics. Never experienced this kind of behaviour from the kernel before.
Still no real luck finding a good, reliable method of getting that panic information out there.Delete
But, on an interesting note -- Decided to give MuQSS + BFQ-SQ another go and, so far it's been stable.
The sole difference -- the MuQSS knobs. Previously I had kernel.interactive and kernel.yield_type set to 0.
Now, with kernel.interactive 1, kernel.yield_type 2 and kernel.rr_interval 1 it is perfectly stable.
Not too sure what to make of that but, with BFQ being the complex scheduler it is, there might be something there.
OK, nevermind... it crashed again. And, coming to the realization that it can't be MuQSS. It has never given me any trouble before.Delete
It has to be BFQ itself. It simply is not production ready.
This is now across 2 different HDDs, so definitely not hardware. Has to be software and has to be BFQ. OK... maybe I'll need to start thinking about spinning my own kernel, to have MuQSS + CFQ.
In my experience once you get muqss to boot it is stable.Delete
Although sometimes it will hang/crash at boot in a virtualbox vm.
MuQSS isn't the issue for me. MuQSS is fine and has been fine for me for quite some time now.Delete
I was assuming originally that it was somehow some form of interaction between MuQSS and BFQ but I'm getting more and more signals it simply is just BFQ itself.
Currently running MuQSS + noop (because Liquorix removed all non-BFQ schedulers from the build, something I am not happy with) and it's fine.
Did take a look at spinning my own kernel, just to get MuQSS + CFQ going but, nah, with my hardware a kernel compile could take anywhere from 90 minutes to 150 minutes. And if I then happen to make a mistake somewhere I can just restart that entire process.
No, I think I'll have to accept noop for some time. Until the BFQ people figure out how to actually get their IO scheduler running properly.
I literally cannot believe it was mainlined in this state. It boggles the mind. Has to be a political decision, not a technological one. OK, that was off-topic, sorry.
using mq-deadline, no problems...ReplyDelete
Don't know if this is the right place to post this, but is there an issue with CUDA working on this kernel right now? I have a few people experiencing the same people on Arch Linux right now.ReplyDelete
Here's the forum post: https://bbs.archlinux.org/viewtopic.php?id=239174
Enabling NUMA in the kernel config seems to have fixed the issue^.Delete
Will this work on 4.18?ReplyDelete
I tried, no.Delete
Con, are you okay? You got lost somewhere.ReplyDelete
Latest 4.17 kernels fails to build or patchReplyDelete
For 4.17.12 I deleted cpufreq_schedutil.c part from the MuQSS patch (and excluded Schedutil from config)
For 4.17.15 I added
Hi Con, thanks and please come back.ReplyDelete
So many of you might need the help of a hacker to do a job.ReplyDelete
But most of you haven't be able to meet one. Well at The Hack geeks, we provide you with the best hacking services there is.
We have hackers in stand by ready to do a job immediately at your pace.
We look forward to hearing from you.
kernel.yield_type = 2ReplyDelete
kernel.rr_interval = 1
kernel.interactive = 1
elevator = noop
Silky smooth. Even on a rotational disk. Silky, silky smooth. MuQSS loves noop or noop loves MuQSS. Not sure which but it's amazing.
Forgot to mention: rqshare = smpDelete