cip/linux-4.19.y-cip baseline-nfs: 12 runs, 1 regressions (v4.19.217-cip62) #kernelci


kernelci.org bot <bot@...>
 

cip/linux-4.19.y-cip baseline-nfs: 12 runs, 1 regressions (v4.19.217-cip62)

Regressions Summary
-------------------

platform | arch | lab | compiler | defconfig | regressions
-----------------+-------+---------------+----------+-----------+------------
rk3399-gru-kevin | arm64 | lab-collabora | gcc-10 | defconfig | 1

Details: https://kernelci.org/test/job/cip/branch/linux-4.19.y-cip/kernel/v4.19.217-cip62/plan/baseline-nfs/

Test: baseline-nfs
Tree: cip
Branch: linux-4.19.y-cip
Describe: v4.19.217-cip62
URL: https://git.kernel.org/pub/scm/linux/kernel/git/cip/linux-cip.git
SHA: dc62e26e3be875a7324b85b8274c13a335e610dd


Test Regressions
----------------


platform | arch | lab | compiler | defconfig | regressions
-----------------+-------+---------------+----------+-----------+------------
rk3399-gru-kevin | arm64 | lab-collabora | gcc-10 | defconfig | 1

Details: https://kernelci.org/test/plan/id/61a58c2989f6953bd118f6e4

Results: 0 PASS, 1 FAIL, 0 SKIP
Full config: defconfig
Compiler: gcc-10 (aarch64-linux-gnu-gcc (Debian 10.2.1-6) 10.2.1 20210110)
Plain log: https://storage.kernelci.org//cip/linux-4.19.y-cip/v4.19.217-cip62/arm64/defconfig/gcc-10/lab-collabora/baseline-nfs-rk3399-gru-kevin.txt
HTML log: https://storage.kernelci.org//cip/linux-4.19.y-cip/v4.19.217-cip62/arm64/defconfig/gcc-10/lab-collabora/baseline-nfs-rk3399-gru-kevin.html
Rootfs: http://storage.kernelci.org/images/rootfs/debian/bullseye/20211126.0/arm64/initrd.cpio.gz


* baseline-nfs.login: https://kernelci.org/test/case/id/61a58c2989f6953bd118f6e5
new failure (last pass: v4.19.216-cip61)


Pavel Machek
 

Hi!

So... I tried to understand this report, and still could not.

First problem is actually in the From: line. By placing bot there, it
is not clear who is responsible for this, and if someone reads replies
to the bot address.

I feel posts to mailing lists should be signed by human responsible
for them.

Then we have:

cip/linux-4.19.y-cip baseline-nfs: 12 runs, 1 regressions (v4.19.217-cip62)
Ok, so we may have an regression. That means it worked before and it
does not work now. I'd expect two versions "worked in v4.19.123-cip12,
now broken in v4.19.217-cip62", but we only have one.

Regressions Summary
-------------------

platform | arch | lab | compiler | defconfig | regressions
-----------------+-------+---------------+----------+-----------+------------
rk3399-gru-kevin | arm64 | lab-collabora | gcc-10 | defconfig | 1
URL: https://git.kernel.org/pub/scm/linux/kernel/git/cip/linux-cip.git
SHA: dc62e26e3be875a7324b85b8274c13a335e610dd
Still no note when it worked last.

HTML log: https://storage.kernelci.org//cip/linux-4.19.y-cip/v4.19.217-cip62/arm64/defconfig/gcc-10/lab-collabora/baseline-nfs-rk3399-gru-kevin.html
Ok, so we have bootlog from a machine, that's quite unhappy. Part of
it are kernel problems, but we see missing firmware, too. In the end,
it looks like it has no usable network card, so it can not do NFS
boot... and panics.

It is hard to tell config problem vs. kernel bug without knowing more
about machine configuration. Best seeing previous successful runs...

Is there human here who believes this is a problem in -cip kernel that
is worth solving, and is willing to answer questions and test patches?

Best regards,

Pavel

02:27:34.443815 <4>[ 1.401574] cacheinfo: Unable to detect cache
hierarchy for CPU 0
02:27:34.452441 <6>[ 1.413496] loop: module loaded
02:27:34.464113 <4>[ 1.421343] rockchip-spi ff1d0000.spi: Failed
to request TX DMA channel
02:27:34.471531 <4>[ 1.429063] rockchip-spi ff1d0000.spi: Failed
to request RX DMA channel
02:27:34.487460 <6>[ 1.446867] m25p80 spi0.0: gd25lq64c (8192
Kbytes)
02:27:34.501792 <4>[ 1.459399] rockchip-spi ff1e0000.spi: Failed
to request TX DMA channel
...
02:27:48.313465 <6>[ 15.264576] atmel_mxt_ts 3-004b: Family: 164
Variant: 14 Firmware V2.3.AA Objects: 40
02:27:48.362782 <4>[ 15.318381] atmel_mxt_ts 3-004b: Direct
firmware load for maxtouch.cfg failed with error -2
02:27:48.535527 <4>[ 15.377455] atmel_mxt_ts 5-004a: Direct
firmware load for maxtouch.cfg failed with error -2
02:27:48.827088 ipconfig: no devices to configure
02:27:48.831012 ipconfig: no devices to configure
02:27:48.926132 <4>[ 15.866511] platform regulatory.0: Direct
firmware load for regulatory.db failed with error -2
02:27:48.931359 <6>[ 15.879008] pci 0000:00:00.0: PCI bridge to
[bus 01]
02:27:48.938653 <6>[ 15.879016] pci 0000:00:00.0: bridge window
[mem 0xfa000000-0xfa1fffff]

--
DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany


Chris Paterson
 

+kernelci ML as they will appreciate the feedback.

From: Pavel Machek <pavel@...>
Sent: 30 November 2021 08:33

Hi!

So... I tried to understand this report, and still could not.

First problem is actually in the From: line. By placing bot there, it
is not clear who is responsible for this, and if someone reads replies
to the bot address.

I feel posts to mailing lists should be signed by human responsible
for them.

Then we have:

cip/linux-4.19.y-cip baseline-nfs: 12 runs, 1 regressions (v4.19.217-cip62)
Ok, so we may have an regression. That means it worked before and it
does not work now. I'd expect two versions "worked in v4.19.123-cip12,
now broken in v4.19.217-cip62", but we only have one.

Regressions Summary
-------------------

platform | arch | lab | compiler | defconfig | regressions
-----------------+-------+---------------+----------+-----------+------------
rk3399-gru-kevin | arm64 | lab-collabora | gcc-10 | defconfig | 1
Did you not have a "details" link here? In my copy of the email (attached) it links to
https://linux.kernelci.org/test/job/cip/branch/linux-4.19.y-cip/kernel/v4.19.217-cip62/plan/baseline-nfs/

If you click on the platform in question it says: "New regression, last pass: v4.19.216-cip61", which also links to the relevant build job.
I think it also says this in the email below the failed test case? (baseline-nfs.login)


URL: https://git.kernel.org/pub/scm/linux/kernel/git/cip/linux-cip.git
SHA: dc62e26e3be875a7324b85b8274c13a335e610dd
Still no note when it worked last.

HTML log: https://storage.kernelci.org//cip/linux-4.19.y-cip/v4.19.217-
cip62/arm64/defconfig/gcc-10/lab-collabora/baseline-nfs-rk3399-gru-
kevin.html
Again, there should also be a link to the specific failed test case, in this case:
https://linux.kernelci.org/test/plan/id/61a58c2989f6953bd118f6e4/

This includes a link to the defconfig used, although perhaps you'd like to know more about the platform.

Kind regards, Chris

Ok, so we have bootlog from a machine, that's quite unhappy. Part of
it are kernel problems, but we see missing firmware, too. In the end,
it looks like it has no usable network card, so it can not do NFS
boot... and panics.

It is hard to tell config problem vs. kernel bug without knowing more
about machine configuration. Best seeing previous successful runs...

Is there human here who believes this is a problem in -cip kernel that
is worth solving, and is willing to answer questions and test patches?

Best regards,

Pavel

02:27:34.443815 <4>[ 1.401574] cacheinfo: Unable to detect cache
hierarchy for CPU 0
02:27:34.452441 <6>[ 1.413496] loop: module loaded
02:27:34.464113 <4>[ 1.421343] rockchip-spi ff1d0000.spi: Failed
to request TX DMA channel
02:27:34.471531 <4>[ 1.429063] rockchip-spi ff1d0000.spi: Failed
to request RX DMA channel
02:27:34.487460 <6>[ 1.446867] m25p80 spi0.0: gd25lq64c (8192
Kbytes)
02:27:34.501792 <4>[ 1.459399] rockchip-spi ff1e0000.spi: Failed
to request TX DMA channel
...
02:27:48.313465 <6>[ 15.264576] atmel_mxt_ts 3-004b: Family: 164
Variant: 14 Firmware V2.3.AA Objects: 40
02:27:48.362782 <4>[ 15.318381] atmel_mxt_ts 3-004b: Direct
firmware load for maxtouch.cfg failed with error -2
02:27:48.535527 <4>[ 15.377455] atmel_mxt_ts 5-004a: Direct
firmware load for maxtouch.cfg failed with error -2
02:27:48.827088 ipconfig: no devices to configure
02:27:48.831012 ipconfig: no devices to configure
02:27:48.926132 <4>[ 15.866511] platform regulatory.0: Direct
firmware load for regulatory.db failed with error -2
02:27:48.931359 <6>[ 15.879008] pci 0000:00:00.0: PCI bridge to
[bus 01]
02:27:48.938653 <6>[ 15.879016] pci 0000:00:00.0: bridge window
[mem 0xfa000000-0xfa1fffff]

--
DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany


Pavel Machek
 

Hi!

Then we have:

cip/linux-4.19.y-cip baseline-nfs: 12 runs, 1 regressions (v4.19.217-cip62)
Ok, so we may have an regression. That means it worked before and it
does not work now. I'd expect two versions "worked in v4.19.123-cip12,
now broken in v4.19.217-cip62", but we only have one.

Regressions Summary
-------------------

platform | arch | lab | compiler | defconfig | regressions
-----------------+-------+---------------+----------+-----------+------------
rk3399-gru-kevin | arm64 | lab-collabora | gcc-10 | defconfig | 1
Did you not have a "details" link here? In my copy of the email (attached) it links to
https://linux.kernelci.org/test/job/cip/branch/linux-4.19.y-cip/kernel/v4.19.217-cip62/plan/baseline-nfs/

If you click on the platform in question it says: "New regression, last pass: v4.19.216-cip61", which also links to the relevant build job.
I think it also says this in the email below the failed test case? (baseline-nfs.login)
I did not find the "New regression, last pass" summary, but clicking
on testcase path indeed gave me failing and passing dmesg.

Working test says:

07:05:59.319716 <4>[ 16.957513] cdn-dp fec00000.dp: Direct firmware
load for rockchip/dptx.bin failed with error -2
07:06:00.206271 <6>[ 17.844209] asix 7-1:1.0 enx48f8b3454d72: link
up, 100Mbps, full-duplex, lpa 0xC1E1
07:06:00.551556 IP-Config: no response after 2 secs - giving up
07:06:00.609271 IP-Config: enx48f8b3454d72 hardware address
48:f8:b3:45:4d:72 mtu 1500 DHCP

And after scrolling around it looks enx48f8b3454d72 is usb ethernet
card.

This includes a link to the defconfig used, although perhaps you'd like to know more about the platform.
Yes, knowing more about platform would be nice (one question would be
"Is the ethernet removable?"). But in case like this I'd need someone
to do the tests for me (or to have enough access to do the tests
myself). First question would be "does v4.19.217 work with same
config"? Easist way to debug this is usually bisection....

Is that feasible?

Best regards,
Pavel
--
DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany


Pavel Machek
 

Hi!

On Tue 2021-11-30 10:22:14, Chris Paterson wrote:
+kernelci ML as they will appreciate the feedback.
Thank you, but it looks like they are rejecting the email :-(.

<kernelci@groups.io>: host lb01.groups.io[45.79.81.153] said: 500 This
message
has been flagged as spam. (in reply to end of DATA command)


Best regards,
Pavel
--
DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany