Difference between revisions of "Poky migration from rocko to warrior"
From ElphelWiki
(Created page with "==Note 1== * /dev/xdevfg got obsolete - there's fpga manager instead which cannot load *.bit (only *.bin or *.bit.bin) * '''Solution:''' Brought back the old driver (drivers/...") |
(→Elphel's kernel tree) |
||
(43 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | ==Note 1== | + | ==Elphel's kernel tree== |
+ | . | ||
+ | ├── <font color='green'>arch</font> | ||
+ | │ └── <font color='green'>arm</font> | ||
+ | │ └── <font color='green'>boot</font> | ||
+ | │ └── <font color='green'>dts/</font> # device trees for 393 cameras, considering tested | ||
+ | ├── <font color='green'>drivers</font> | ||
+ | │ ├── <font color='green'>ata</font> | ||
+ | │ │ ├── <font color='green'>ahci_elphel.c</font> # tested reading and writing from/to SSD | ||
+ | │ │ └── <font color='green'>libata-eh.c</font> | ||
+ | │ ├── <font color='green'>char</font> | ||
+ | │ │ └── <font color='green'>xilinx_devcfg.c</font> # '''tested bitstream loading''' - brought back the old character device driver, it's simpler this way than the new one FPGA manager that can load only .bit.bin files | ||
+ | │ ├── <font color='green'>clk</font> | ||
+ | │ │ └── <font color='green'>clk-si5338.c</font> # chip found, no errors | ||
+ | │ ├── elphel | ||
+ | │ │ ├── <font color='green'>circbuf.c</font> # tested via recording | ||
+ | │ │ ├── clock10359.c | ||
+ | │ │ ├── <font color='green'>command_sequencer.c</font> # ok | ||
+ | │ │ ├── cxi2c.c | ||
+ | │ │ ├── <font color='green'>detect_sensors.c</font> | ||
+ | │ │ ├── <font color='green'>elphel393-init.c</font> # ok | ||
+ | │ │ ├── <font color='green'>elphel393-mem.c</font> # ok | ||
+ | │ │ ├── <font color='green'>elphel393-pwr.c</font> # ok | ||
+ | │ │ ├── exif393.c | ||
+ | │ │ ├── fpgajtag353.c | ||
+ | │ │ ├── <font color='green'>framepars.c</font> # ok | ||
+ | │ │ ├── <font color='green'>gamma_tables.c</font> # affects images which look ok | ||
+ | │ │ ├── <font color='green'>histograms.c</font> # displayed | ||
+ | │ │ ├── imu_log393.c | ||
+ | │ │ ├── jpeghead.c | ||
+ | │ │ ├── klogger_393.c | ||
+ | │ │ ├── lepton.c | ||
+ | │ │ ├── mt9f002.c | ||
+ | │ │ ├── <font color='green'>mt9x001.c</font> # sensor is programmed correctly | ||
+ | │ │ ├── multi10359.c | ||
+ | │ │ ├── <font color='green'>pgm_functions.c</font> # parameters are getting applied correctly (mt9p006) | ||
+ | │ │ ├── <font color='green'>quantization_tables.c</font> # images not broken | ||
+ | │ │ ├── <font color='green'>sensor_common.c</font> | ||
+ | │ │ ├── <font color='green'>sensor_i2c.c</font> | ||
+ | │ │ ├── <font color='green'>x393.c</font> | ||
+ | │ │ ├── <font color='green'>x393_fpga_functions.c</font> # ok | ||
+ | │ │ └── <font color='green'>x393_videomem.c</font> # also used in circbuf => recording => works | ||
+ | │ ├── <font color='green'>misc</font> | ||
+ | │ │ ├── <font color='green'>ltc3589.c</font> | ||
+ | │ │ └── <font color='green'>vsc330x.c</font> # switching between internal and external SSD ports works | ||
+ | │ ├── <font color='green'>mmc</font> | ||
+ | │ │ └── <font color='green'>host</font> | ||
+ | │ │ └── <font color='green'>sdhci.c</font> # this needed chip detect ORed with dat3: SDHCI_ANY_PRESENT = SDHCI_CARD_PRESENT | SDHCI_DAT3_PRESENT | ||
+ | │ ├── <font color='green'>mtd</font> | ||
+ | │ │ └── <font color='green'>nand</font> # added functions to work with OTP, tested only reading | ||
+ | │ │ ├── <font color='green'>nand_base.c</font> | ||
+ | │ │ ├── <font color='green'>nandchip-micron.c</font> | ||
+ | │ │ └── <font color='green'>pl35x_nand.c</font> | ||
+ | │ ├── <font color='green'>net</font> | ||
+ | │ │ └── <font color='green'>ethernet</font> | ||
+ | │ │ └── <font color='green'>cadence</font> | ||
+ | │ │ └── <font color='green'>macb_main.c</font> # needed fixup for Atheros chip - disable SmartEEE | ||
+ | │ └── <font color='green'>rtc</font> | ||
+ | │ └── <font color='green'>rtc-m41t80.c</font> # updated to latest version. Our changes only ignore Oscillator failure at boot at m41t80_get_datetime(). | ||
+ | ├── helpers | ||
+ | │ └── si5338_register_map_dts.py # test it? | ||
+ | ├── other | ||
+ | │ └── mem.py | ||
+ | └── <font color='green'>patches</font> | ||
+ | ├── ahci.patch | ||
+ | ├── drivers-elphel.patch | ||
+ | ├── garmin_usb.c.patch | ||
+ | └── libahci.patch | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 1: Bring back fpga char device== | ||
* /dev/xdevfg got obsolete - there's fpga manager instead which cannot load *.bit (only *.bin or *.bit.bin) | * /dev/xdevfg got obsolete - there's fpga manager instead which cannot load *.bit (only *.bin or *.bit.bin) | ||
* '''Solution:''' | * '''Solution:''' | ||
− | Brought back the old driver (drivers/char/xilinx_devcfg.c)- it works | + | Brought back the old driver (drivers/char/xilinx_devcfg.c and edited Kconfig and Makefile)- it works as it used to |
− | ==Note 2== | + | ==<font color='green'>'''[SOLVED]'''</font> Note 2: Build php 5.6.40== |
− | * php 5.6.40 - EOL and won't build | + | * php 5.6.40 - EOL and won't build - mysql supposedly moved header files. |
* '''Solution:''' | * '''Solution:''' | ||
Disabled mysql extension: | Disabled mysql extension: | ||
Line 11: | Line 80: | ||
PACKAGECONFIG[mysql] = "--without-mysql --without-mysqli --without-pdo-mysql" | PACKAGECONFIG[mysql] = "--without-mysql --without-mysqli --without-pdo-mysql" | ||
CFLAGS += " -ldl" | CFLAGS += " -ldl" | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 3: Entropy device hwrng== | ||
+ | * New package '''rng-tools''' is whining: ''Failed to init entropy source hwrng'' | ||
+ | * '''Solution:''' | ||
+ | Leave as is for now. The full log is: | ||
+ | <font size='1'>''Initalizing available sources | ||
+ | Failed to init entropy source hwrng | ||
+ | Enabling JITTER rng support | ||
+ | Initalizing entropy source jitter''</font> | ||
+ | * Comments: | ||
+ | ** Haven't found if Xilinx uses any driver for /dev/hwrng | ||
+ | ** TODO: Find out if the order of entropy sources can be changed | ||
+ | |||
+ | ==<font color='green'>'''-'''</font> Note 4: PHP causing 'unsupported FP instruction in kernel mode'== | ||
+ | |||
+ | * '''autocampars.php''' runs at boot and sometimes causes Kernel Oops: | ||
+ | |||
+ | <font size='1'>[ 35.872118] BUG: unsupported FP instruction in kernel mode | ||
+ | [ 35.877621] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP ARM | ||
+ | [ 35.883380] Modules linked in: | ||
+ | [ 35.886498] CPU: 1 PID: 1756 Comm: php Not tainted 4.14.0-xilinx-v2018.3 #1 | ||
+ | [ 35.893459] Hardware name: Xilinx Zynq Platform | ||
+ | [ 35.897989] task: ee83f280 task.stack: ef1d6000 | ||
+ | [ 35.902527] PC is at vfp_reload_hw+0x30/0x44 | ||
+ | [ 35.906802] LR is at __und_usr_fault_32+0x0/0x8 | ||
+ | [ 35.911338] pc : [<c0102e10>] lr : [<c010c280>] psr: a0000013 | ||
+ | [ 35.917529] sp : ef1d7fb0 ip : 00000051 fp : 00000001 | ||
+ | [ 35.922813] r10: ef1d61f8 r9 : c010c308 r8 : ee9893c0 | ||
+ | [ 35.928040] r7 : 00000001 r6 : 00400100 r5 : c0138d08 r4 : ecd600f8 | ||
+ | [ 35.934569] r3 : c0c6c064 r2 : b67bde8c r1 : ecd9a224 r0 : eeb00a40 | ||
+ | [ 35.941098] Flags: NzCv IRQs on FIQs on Mode SVC_32 ISA ARM Segment none | ||
+ | [ 35.948241] Control: 18c5387d Table: 2cda404a DAC: 00000051 | ||
+ | [ 35.953993] Process php (pid: 1756, stack limit = 0xef1d6210) | ||
+ | [ 35.959740] Stack: (0xef1d7fb0 to 0xef1d8000) | ||
+ | [ 35.964020] 7fa0: a5f43f50 a5f43e18 00000080 00000000 | ||
+ | [ 35.972269] 7fc0: 00000000 a5f43f4c b687b338 000000ae 00000000 bedcdfe4 00000001 a5f43ffc | ||
+ | [ 35.980385] 7fe0: a5f43f50 a5f43d7c b676cf78 b67bde8c 60000010 ffffffff 00000000 00000000 | ||
+ | [ 35.988626] Code: 128aa080 e89a0162 e3110102 0a000003 (eee96a10) | ||
+ | [ 35.994724] ---[ end trace 06029778db6d2d90 ]--- | ||
+ | [ 35.999422] note: php[1756] exited with preempt_count 2</font> | ||
+ | |||
+ | Unsupported floating point instruction in kernel? | ||
+ | |||
+ | * Is it hardware (some faulty board? temperature based?) or kernel or php? | ||
+ | |||
+ | * solution?: | ||
+ | Took arch/vfp/vfpmodule.c from kernel 4.19 | ||
+ | The current was 4.14 | ||
+ | It didn't work. Roll back and check which php call caused it? | ||
+ | |||
+ | * TODO: keep an eye on this, because the real reason is not investigated | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 5: Bring up NAND OTP support== | ||
+ | * MAC is not read from NAND, displays the default: 00:0e:64:10:00:00 | ||
+ | * Problem? | ||
+ | [ 3.639851] elphel393-init: Flash page read, code -95 | ||
+ | * Comments: | ||
+ | ** Lookup what had changed. | ||
+ | * '''Solution:''' (for xlnx_rebase_v4.14 branch of linux-xlnx): | ||
+ | In drivers/mtd/nand_base.c in nand_scan_tail() they call nand_manufacturer_init() | ||
+ | which is mapped to a new driver drivers/mtd/nand_micron.c. | ||
+ | So, when it fails - the driver init fails - mtd functions do not get assigned. | ||
+ | (And the driver (drivers/elphel/elphel393_init.c) that reads from OTP area returns | ||
+ | -95 which is EOPNOTSUPP.) | ||
+ | We just need to fall through for a quick fix. | ||
+ | |||
+ | The reason that function exits with an error is it decides that it does not support | ||
+ | forcefully enabled on-die ECC. And this needs to be investigated. | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 6: udev - unknown group 'kvm'== | ||
+ | * Problem: | ||
+ | [ 5.817352] udevd[1478]: starting version 3.2.7 | ||
+ | [ 5.918028] udevd[1478]: specified group 'kvm' unknown | ||
+ | [ 5.986364] udevd[1479]: starting eudev-3.2.7 | ||
+ | [ 6.142897] udevd[1479]: specified group 'kvm' unknown | ||
+ | |||
+ | * Solution: | ||
+ | KVM == Kernel-based Virtual Machine. Remove for now (and maybe forever) | ||
+ | . | ||
+ | └── udev | ||
+ | ├── eudev | ||
+ | │ └── 50-udev-default.rules | ||
+ | └── eudev_3.2.7.bbappend | ||
+ | |||
+ | 50-udev-default.rules - gets installed over the original file. | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 7: Add back fixup for Atheros to updated ethernet driver== | ||
+ | * Problem: | ||
+ | Ethernet driver's structure has changed. It was split into several files. | ||
+ | Lives at /driver/net/ethernet/cadence/ | ||
+ | * Soluton: | ||
+ | For out ethernet chip (Atheros 80xx) a fixup had to be added to disable SmartEEE. | ||
+ | It's a single function, call and a couple defines - added all back to the new driver structure. | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 8: u-boot update== | ||
+ | * update u-boot | ||
+ | * solution: | ||
+ | Updated to 2019.07 mainstream u-boot | ||
+ | - converted our *.h (with params used to generate SPL header) to Kconfigs | ||
+ | - updated driver for NAND flash - tested both boot modes - mmc and nand | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 9: test camogm== | ||
+ | * test '''camogm''' | ||
+ | /var/state/camogm_cmd accepts only the first write - switch to polling? | ||
+ | when switched to polling - when recording - buffer gets overflow. Because the polling version does not work correctly probably. | ||
+ | All is working for the version without polling - after adding EOF reset (clearerr(npipe)) right after reading from the pipe and checking if feof(). | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 10: test streamer== | ||
+ | * test '''streamer''' | ||
+ | Streamer works | ||
+ | |||
+ | ==<font color='green'>'''[SOLVED]'''</font> Note 11: test AHCI driver== | ||
+ | * test ahci driver | ||
+ | * results: | ||
+ | - SSD is detected and automounted | ||
+ | - write/read works | ||
+ | |||
+ | ==<font color='green'>'''-'''</font> Note 12: test raw recording== | ||
+ | * test recording on a raw partition |
Revision as of 10:06, 1 August 2019
Contents
- 1 Elphel's kernel tree
- 2 [SOLVED] Note 1: Bring back fpga char device
- 3 [SOLVED] Note 2: Build php 5.6.40
- 4 [SOLVED] Note 3: Entropy device hwrng
- 5 - Note 4: PHP causing 'unsupported FP instruction in kernel mode'
- 6 [SOLVED] Note 5: Bring up NAND OTP support
- 7 [SOLVED] Note 6: udev - unknown group 'kvm'
- 8 [SOLVED] Note 7: Add back fixup for Atheros to updated ethernet driver
- 9 [SOLVED] Note 8: u-boot update
- 10 [SOLVED] Note 9: test camogm
- 11 [SOLVED] Note 10: test streamer
- 12 [SOLVED] Note 11: test AHCI driver
- 13 - Note 12: test raw recording
Elphel's kernel tree
. ├── arch │ └── arm │ └── boot │ └── dts/ # device trees for 393 cameras, considering tested ├── drivers │ ├── ata │ │ ├── ahci_elphel.c # tested reading and writing from/to SSD │ │ └── libata-eh.c │ ├── char │ │ └── xilinx_devcfg.c # tested bitstream loading - brought back the old character device driver, it's simpler this way than the new one FPGA manager that can load only .bit.bin files │ ├── clk │ │ └── clk-si5338.c # chip found, no errors │ ├── elphel │ │ ├── circbuf.c # tested via recording │ │ ├── clock10359.c │ │ ├── command_sequencer.c # ok │ │ ├── cxi2c.c │ │ ├── detect_sensors.c │ │ ├── elphel393-init.c # ok │ │ ├── elphel393-mem.c # ok │ │ ├── elphel393-pwr.c # ok │ │ ├── exif393.c │ │ ├── fpgajtag353.c │ │ ├── framepars.c # ok │ │ ├── gamma_tables.c # affects images which look ok │ │ ├── histograms.c # displayed │ │ ├── imu_log393.c │ │ ├── jpeghead.c │ │ ├── klogger_393.c │ │ ├── lepton.c │ │ ├── mt9f002.c │ │ ├── mt9x001.c # sensor is programmed correctly │ │ ├── multi10359.c │ │ ├── pgm_functions.c # parameters are getting applied correctly (mt9p006) │ │ ├── quantization_tables.c # images not broken │ │ ├── sensor_common.c │ │ ├── sensor_i2c.c │ │ ├── x393.c │ │ ├── x393_fpga_functions.c # ok │ │ └── x393_videomem.c # also used in circbuf => recording => works │ ├── misc │ │ ├── ltc3589.c │ │ └── vsc330x.c # switching between internal and external SSD ports works │ ├── mmc │ │ └── host │ │ └── sdhci.c # this needed chip detect ORed with dat3: SDHCI_ANY_PRESENT = SDHCI_CARD_PRESENT | SDHCI_DAT3_PRESENT │ ├── mtd │ │ └── nand # added functions to work with OTP, tested only reading │ │ ├── nand_base.c │ │ ├── nandchip-micron.c │ │ └── pl35x_nand.c │ ├── net │ │ └── ethernet │ │ └── cadence │ │ └── macb_main.c # needed fixup for Atheros chip - disable SmartEEE │ └── rtc │ └── rtc-m41t80.c # updated to latest version. Our changes only ignore Oscillator failure at boot at m41t80_get_datetime(). ├── helpers │ └── si5338_register_map_dts.py # test it? ├── other │ └── mem.py └── patches ├── ahci.patch ├── drivers-elphel.patch ├── garmin_usb.c.patch └── libahci.patch
[SOLVED] Note 1: Bring back fpga char device
- /dev/xdevfg got obsolete - there's fpga manager instead which cannot load *.bit (only *.bin or *.bit.bin)
- Solution:
Brought back the old driver (drivers/char/xilinx_devcfg.c and edited Kconfig and Makefile)- it works as it used to
[SOLVED] Note 2: Build php 5.6.40
- php 5.6.40 - EOL and won't build - mysql supposedly moved header files.
- Solution:
Disabled mysql extension: To meta-elphel393/recipes-devtools/php/php_5.6.%.bbappend: PACKAGECONFIG[mysql] = "--without-mysql --without-mysqli --without-pdo-mysql" CFLAGS += " -ldl"
[SOLVED] Note 3: Entropy device hwrng
- New package rng-tools is whining: Failed to init entropy source hwrng
- Solution:
Leave as is for now. The full log is: Initalizing available sources Failed to init entropy source hwrng Enabling JITTER rng support Initalizing entropy source jitter
- Comments:
- Haven't found if Xilinx uses any driver for /dev/hwrng
- TODO: Find out if the order of entropy sources can be changed
- Note 4: PHP causing 'unsupported FP instruction in kernel mode'
- autocampars.php runs at boot and sometimes causes Kernel Oops:
[ 35.872118] BUG: unsupported FP instruction in kernel mode [ 35.877621] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP ARM [ 35.883380] Modules linked in: [ 35.886498] CPU: 1 PID: 1756 Comm: php Not tainted 4.14.0-xilinx-v2018.3 #1 [ 35.893459] Hardware name: Xilinx Zynq Platform [ 35.897989] task: ee83f280 task.stack: ef1d6000 [ 35.902527] PC is at vfp_reload_hw+0x30/0x44 [ 35.906802] LR is at __und_usr_fault_32+0x0/0x8 [ 35.911338] pc : [<c0102e10>] lr : [<c010c280>] psr: a0000013 [ 35.917529] sp : ef1d7fb0 ip : 00000051 fp : 00000001 [ 35.922813] r10: ef1d61f8 r9 : c010c308 r8 : ee9893c0 [ 35.928040] r7 : 00000001 r6 : 00400100 r5 : c0138d08 r4 : ecd600f8 [ 35.934569] r3 : c0c6c064 r2 : b67bde8c r1 : ecd9a224 r0 : eeb00a40 [ 35.941098] Flags: NzCv IRQs on FIQs on Mode SVC_32 ISA ARM Segment none [ 35.948241] Control: 18c5387d Table: 2cda404a DAC: 00000051 [ 35.953993] Process php (pid: 1756, stack limit = 0xef1d6210) [ 35.959740] Stack: (0xef1d7fb0 to 0xef1d8000) [ 35.964020] 7fa0: a5f43f50 a5f43e18 00000080 00000000 [ 35.972269] 7fc0: 00000000 a5f43f4c b687b338 000000ae 00000000 bedcdfe4 00000001 a5f43ffc [ 35.980385] 7fe0: a5f43f50 a5f43d7c b676cf78 b67bde8c 60000010 ffffffff 00000000 00000000 [ 35.988626] Code: 128aa080 e89a0162 e3110102 0a000003 (eee96a10) [ 35.994724] ---[ end trace 06029778db6d2d90 ]--- [ 35.999422] note: php[1756] exited with preempt_count 2
Unsupported floating point instruction in kernel?
- Is it hardware (some faulty board? temperature based?) or kernel or php?
- solution?:
Took arch/vfp/vfpmodule.c from kernel 4.19 The current was 4.14 It didn't work. Roll back and check which php call caused it?
- TODO: keep an eye on this, because the real reason is not investigated
[SOLVED] Note 5: Bring up NAND OTP support
- MAC is not read from NAND, displays the default: 00:0e:64:10:00:00
- Problem?
[ 3.639851] elphel393-init: Flash page read, code -95
- Comments:
- Lookup what had changed.
- Solution: (for xlnx_rebase_v4.14 branch of linux-xlnx):
In drivers/mtd/nand_base.c in nand_scan_tail() they call nand_manufacturer_init() which is mapped to a new driver drivers/mtd/nand_micron.c. So, when it fails - the driver init fails - mtd functions do not get assigned. (And the driver (drivers/elphel/elphel393_init.c) that reads from OTP area returns -95 which is EOPNOTSUPP.) We just need to fall through for a quick fix.
The reason that function exits with an error is it decides that it does not support forcefully enabled on-die ECC. And this needs to be investigated.
[SOLVED] Note 6: udev - unknown group 'kvm'
- Problem:
[ 5.817352] udevd[1478]: starting version 3.2.7 [ 5.918028] udevd[1478]: specified group 'kvm' unknown [ 5.986364] udevd[1479]: starting eudev-3.2.7 [ 6.142897] udevd[1479]: specified group 'kvm' unknown
- Solution:
KVM == Kernel-based Virtual Machine. Remove for now (and maybe forever) . └── udev ├── eudev │ └── 50-udev-default.rules └── eudev_3.2.7.bbappend
50-udev-default.rules - gets installed over the original file.
[SOLVED] Note 7: Add back fixup for Atheros to updated ethernet driver
- Problem:
Ethernet driver's structure has changed. It was split into several files. Lives at /driver/net/ethernet/cadence/
- Soluton:
For out ethernet chip (Atheros 80xx) a fixup had to be added to disable SmartEEE. It's a single function, call and a couple defines - added all back to the new driver structure.
[SOLVED] Note 8: u-boot update
- update u-boot
- solution:
Updated to 2019.07 mainstream u-boot - converted our *.h (with params used to generate SPL header) to Kconfigs - updated driver for NAND flash - tested both boot modes - mmc and nand
[SOLVED] Note 9: test camogm
- test camogm
/var/state/camogm_cmd accepts only the first write - switch to polling? when switched to polling - when recording - buffer gets overflow. Because the polling version does not work correctly probably. All is working for the version without polling - after adding EOF reset (clearerr(npipe)) right after reading from the pipe and checking if feof().
[SOLVED] Note 10: test streamer
- test streamer
Streamer works
[SOLVED] Note 11: test AHCI driver
- test ahci driver
- results:
- SSD is detected and automounted - write/read works
- Note 12: test raw recording
- test recording on a raw partition