Intel Ethernet bug #12

Closed
opened 2021-01-05 12:02:39 +08:00 by sb10q · 3 comments

Internet connection to nixbld failed after ~160 days due to Intel ethernet. This happened in a loop until the machine was rebooted.

Jan 05 10:52:03 nixbld dhcpcd[10349]: enp0s31f6: carrier acquired
Jan 05 10:52:03 nixbld kernel: e1000e: enp0s31f6 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None
Jan 05 10:52:03 nixbld dhcpcd[10349]: enp0s31f6: IAID bc:12:91:70
Jan 05 10:52:03 nixbld dhcpcd[10349]: enp0s31f6: adding address fe80::21f:bcff:fe12:9170
Jan 05 10:52:03 nixbld dhcpcd[10349]: enp0s31f6: soliciting a DHCP lease
Jan 05 10:52:04 nixbld dhcpcd[10349]: enp0s31f6: soliciting an IPv6 router
Jan 05 10:52:05 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                                 TDH                  <0>
                                 TDT                  <b>
                                 next_to_use          <b>
                                 next_to_clean        <0>
                               buffer_info[next_to_clean]:
                                 time_stamp           <446bebee3>
                                 next_to_watch        <0>
                                 jiffies              <446bec6c1>
                                 next_to_watch.status <0>
                               MAC Status             <40080083>
                               PHY Status             <796d>
                               PHY 1000BASE-T Status  <3800>
                               PHY Extended Status    <3000>
Jan 05 10:52:07 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                                 TDH                  <0>
                                 TDT                  <b>
                                 next_to_use          <b>
                                 next_to_clean        <0>
                               buffer_info[next_to_clean]:
                                 time_stamp           <446bebee3>
                                 next_to_watch        <0>
                                 jiffies              <446bece80>
                                 next_to_watch.status <0>
                               MAC Status             <40080083>
                               PHY Status             <796d>
                               PHY 1000BASE-T Status  <3800>
                               PHY Extended Status    <3000>
                               PCI Status             <10>
                            PCI Status             <10>
Jan 05 10:52:09 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                                 TDH                  <0>
                                 TDT                  <b>
                                 next_to_use          <b>
                                 next_to_clean        <0>
                               buffer_info[next_to_clean]:
                                 time_stamp           <446bebee3>
                                 next_to_watch        <0>
                                 jiffies              <446bed640>
                                 next_to_watch.status <0>
                               MAC Status             <40080083>
                               PHY Status             <796d>
                               PHY 1000BASE-T Status  <3800>
                               PHY Extended Status    <3000>
                               PCI Status             <10>
Jan 05 10:52:11 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                                 TDH                  <0>
                                 TDT                  <b>
                                 next_to_use          <b>
                                 next_to_clean        <0>
                               buffer_info[next_to_clean]:
                                 time_stamp           <446bebee3>
                                 next_to_watch        <0>
                                 jiffies              <446bede40>
                                 next_to_watch.status <0>
                               MAC Status             <40080083>
                               PHY Status             <796d>
                               PHY 1000BASE-T Status  <3800>
                               PHY Extended Status    <3000>
                               PCI Status             <10>
Jan 05 10:52:13 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                                 TDH                  <0>
                                 TDT                  <b>
                                 next_to_use          <b>
                                 next_to_clean        <0>
                               buffer_info[next_to_clean]:
                                 time_stamp           <446bebee3>
                                 next_to_watch        <0>
                                 jiffies              <446bee600>
                                 next_to_watch.status <0>
                               MAC Status             <40080083>
                               PHY Status             <796d>
                               PHY 1000BASE-T Status  <3800>
                               PHY Extended Status    <3000>
                               PCI Status             <10>
Jan 05 10:52:13 nixbld dhcpcd[10349]: enp0s31f6: using IPv4LL address 169.254.216.39
Jan 05 10:52:13 nixbld dhcpcd[10349]: enp0s31f6: adding route to 169.254.0.0/16
Jan 05 10:52:13 nixbld dhcpcd[10349]: enp0s31f6: adding default route
Jan 05 10:52:14 nixbld dhcpcd[10349]: enp0s31f6: carrier lost
Jan 05 10:52:14 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Reset adapter unexpectedly
Jan 05 10:52:14 nixbld dhcpcd[10349]: enp0s31f6: deleting address fe80::21f:bcff:fe12:9170
Jan 05 10:52:14 nixbld dhcpcd[10349]: enp0s31f6: deleting route to 169.254.0.0/16
Jan 05 10:52:14 nixbld dhcpcd[10349]: enp0s31f6: deleting default route

Related: https://superuser.com/questions/1270723/how-to-fix-eth0-detected-hardware-unit-hang-in-debian-9

Internet connection to nixbld failed after ~160 days due to Intel ethernet. This happened in a loop until the machine was rebooted. ``` Jan 05 10:52:03 nixbld dhcpcd[10349]: enp0s31f6: carrier acquired Jan 05 10:52:03 nixbld kernel: e1000e: enp0s31f6 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None Jan 05 10:52:03 nixbld dhcpcd[10349]: enp0s31f6: IAID bc:12:91:70 Jan 05 10:52:03 nixbld dhcpcd[10349]: enp0s31f6: adding address fe80::21f:bcff:fe12:9170 Jan 05 10:52:03 nixbld dhcpcd[10349]: enp0s31f6: soliciting a DHCP lease Jan 05 10:52:04 nixbld dhcpcd[10349]: enp0s31f6: soliciting an IPv6 router Jan 05 10:52:05 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang: TDH <0> TDT <b> next_to_use <b> next_to_clean <0> buffer_info[next_to_clean]: time_stamp <446bebee3> next_to_watch <0> jiffies <446bec6c1> next_to_watch.status <0> MAC Status <40080083> PHY Status <796d> PHY 1000BASE-T Status <3800> PHY Extended Status <3000> Jan 05 10:52:07 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang: TDH <0> TDT <b> next_to_use <b> next_to_clean <0> buffer_info[next_to_clean]: time_stamp <446bebee3> next_to_watch <0> jiffies <446bece80> next_to_watch.status <0> MAC Status <40080083> PHY Status <796d> PHY 1000BASE-T Status <3800> PHY Extended Status <3000> PCI Status <10> PCI Status <10> Jan 05 10:52:09 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang: TDH <0> TDT <b> next_to_use <b> next_to_clean <0> buffer_info[next_to_clean]: time_stamp <446bebee3> next_to_watch <0> jiffies <446bed640> next_to_watch.status <0> MAC Status <40080083> PHY Status <796d> PHY 1000BASE-T Status <3800> PHY Extended Status <3000> PCI Status <10> Jan 05 10:52:11 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang: TDH <0> TDT <b> next_to_use <b> next_to_clean <0> buffer_info[next_to_clean]: time_stamp <446bebee3> next_to_watch <0> jiffies <446bede40> next_to_watch.status <0> MAC Status <40080083> PHY Status <796d> PHY 1000BASE-T Status <3800> PHY Extended Status <3000> PCI Status <10> Jan 05 10:52:13 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang: TDH <0> TDT <b> next_to_use <b> next_to_clean <0> buffer_info[next_to_clean]: time_stamp <446bebee3> next_to_watch <0> jiffies <446bee600> next_to_watch.status <0> MAC Status <40080083> PHY Status <796d> PHY 1000BASE-T Status <3800> PHY Extended Status <3000> PCI Status <10> Jan 05 10:52:13 nixbld dhcpcd[10349]: enp0s31f6: using IPv4LL address 169.254.216.39 Jan 05 10:52:13 nixbld dhcpcd[10349]: enp0s31f6: adding route to 169.254.0.0/16 Jan 05 10:52:13 nixbld dhcpcd[10349]: enp0s31f6: adding default route Jan 05 10:52:14 nixbld dhcpcd[10349]: enp0s31f6: carrier lost Jan 05 10:52:14 nixbld kernel: e1000e 0000:00:1f.6 enp0s31f6: Reset adapter unexpectedly Jan 05 10:52:14 nixbld dhcpcd[10349]: enp0s31f6: deleting address fe80::21f:bcff:fe12:9170 Jan 05 10:52:14 nixbld dhcpcd[10349]: enp0s31f6: deleting route to 169.254.0.0/16 Jan 05 10:52:14 nixbld dhcpcd[10349]: enp0s31f6: deleting default route ``` Related: https://superuser.com/questions/1270723/how-to-fix-eth0-detected-hardware-unit-hang-in-debian-9
Poster
Owner

This is "Intel Corporation Ethernet Connection (2) I219-V [8086:15b8]"

This is "Intel Corporation Ethernet Connection (2) I219-V [8086:15b8]"
Poster
Owner

This is happening more frequently and becoming a serious annoyance. Will probably switch to the new server soon.

This is happening more frequently and becoming a serious annoyance. Will probably switch to the new server soon.
Poster
Owner

Done.
Postgresql is being a royal pain as usual and breaking Mattermost, the rest is working nicely.

Done. Postgresql is being a [royal pain](https://stackoverflow.com/questions/20427689/psql-invalid-command-n-while-restore-sql/20428547) as usual and breaking Mattermost, the rest is working nicely.
sb10q closed this issue 2021-08-07 10:40:41 +08:00
Sign in to join this conversation.
No Label
No Milestone
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: M-Labs/it-infra#12
There is no content yet.