Re: [PATCH printk v5 06/40] printk: fix setting first seq for consoles
From: Petr Mladek
Date: Fri Nov 18 2022 - 05:24:00 EST
On Wed 2022-11-16 17:27:18, John Ogness wrote:
> It used to be that all consoles were synchronized with respect to
> which message they were printing. After commit a699449bb13b ("printk:
> refactor and rework printing logic"), all consoles have their own
> @seq for tracking which message they are on. That commit also changed
> how the initial sequence number was chosen. Instead of choosing the
> next non-printed message, it chose the sequence number of the next
> message that will be added to the ringbuffer.
>
> That change created a possibility that a non-boot console taking over
> for a boot console might skip messages if the boot console was behind
> and did not have a chance to catch up before being unregistered.
>
> Since it is not known which boot console is the same device, flush
> all consoles and, if necessary, start with the message of the enabled
> boot console that is the furthest behind. If no boot consoles are
> enabled, begin with the next message that will be added to the
> ringbuffer.
>
> Also, since boot consoles are meant to be used at boot time, handle
> them the same as CON_PRINTBUFFER to ensure that no initial messages
> are skipped.
>
> Signed-off-by: John Ogness <john.ogness@xxxxxxxxxxxxx>
Reviewed-by: Petr Mladek <pmladek@xxxxxxxx>
See one possible improvement below.
> --- a/kernel/printk/printk.c
> +++ b/kernel/printk/printk.c
> @@ -3131,16 +3131,56 @@ static void try_enable_default_console(struct console *newcon)
> (con->flags & CON_BOOT) ? "boot" : "", \
> con->name, con->index, ##__VA_ARGS__)
>
> -static void console_init_seq(struct console *newcon)
> +static void console_init_seq(struct console *newcon, bool bootcon_registered)
> {
> - if (newcon->flags & CON_PRINTBUFFER) {
> + struct console *con;
> + bool handover;
> +
> + if (newcon->flags & (CON_PRINTBUFFER | CON_BOOT)) {
> /* Get a consistent copy of @syslog_seq. */
> mutex_lock(&syslog_lock);
> newcon->seq = syslog_seq;
> mutex_unlock(&syslog_lock);
> } else {
> - /* Begin with next message. */
> + /* Begin with next message added to ringbuffer. */
> newcon->seq = prb_next_seq(prb);
> +
> + /*
> + * If any enabled boot consoles are due to be unregistered
> + * shortly, some may not be caught up and may be the same
> + * device as @newcon. Since it is not known which boot console
> + * is the same device, flush all consoles and, if necessary,
> + * start with the message of the enabled boot console that is
> + * the furthest behind.
> + */
> + if (bootcon_registered && !keep_bootcon) {
> + /*
> + * Flush all consoles and set the console to start at
> + * the next unprinted sequence number.
> + */
> + if (!console_flush_all(true, &newcon->seq, &handover)) {
> + /*
> + * Flushing failed. Just choose the lowest
> + * sequence of the enabled boot consoles.
> + */
> +
> + /*
> + * If there was a handover, this context no
> + * longer holds the console_lock.
> + */
> + if (handover)
> + console_lock();
Another improvement might be to disable handover in this case.
It would be safe because we are in a sleepable context.
It would increase the chance that console_fluhs_all() succeeded.
On the other hand, it might cause that this caller gets stuck
here because of flood of messages printed by another caller.
We could do this later when there are problems with this approach.
The problem with the handover has been there even before.
I do not want to delay this patchset by discussion this non-critical
problem to the death ;-)
Best Regards,
Petr