Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Pappy's Kernel Seeds Part IV
View unanswered posts
View posts from last 24 hours

Goto page Previous  1, 2, 3 ... 11, 12, 13 ... 18, 19, 20  Next  
This topic is locked: you cannot edit posts or make replies.    Gentoo Forums Forum Index Unsupported Software
View previous topic :: View next topic  
Author Message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Wed Apr 18, 2012 6:06 am    Post subject: Reply with quote

Moriah,

Yes, ACPI was turned on. I'm curious as to what that might be. Perhaps play with the individual parts of ACPI until you find the true offender, or turn on MSI to see if it is now willing to work.

I wish I could get my hands on that mobo to see what its general dysfunction is.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.


Last edited by pappy_mcfae on Wed Apr 18, 2012 6:08 am; edited 1 time in total
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Wed Apr 18, 2012 6:07 am    Post subject: Reply with quote

Another day, and lots of hardened sources, plus a gentoo. I've just uploaded .configs for 2.6.32-hardened-r100, linux-3.2.15-hardened, 3.3.2-gentoo, and 3.3.2-hardened in both x86 and x86_64 flavors. Enjoy!

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Thu Apr 19, 2012 6:35 pm    Post subject: Reply with quote

Pappy: I just had a crash using the config you put together, but with ACPI turned off. The dmesg is at:

http://www.elilabs.com/~rj/ezekiel.dmesg

Please take a look at it, if for nothing more than the mobo model number. :(
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 43577
Location: 56N 3W

PostPosted: Thu Apr 19, 2012 7:01 pm    Post subject: Reply with quote

Moriah,

Forbidden

You don't have permission to access /~rj/ezekiel.dmesg on this server.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Thu Apr 19, 2012 8:03 pm    Post subject: Reply with quote

Same here.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Thu Apr 19, 2012 8:17 pm    Post subject: Reply with quote

Oops! Sorry, forgot to change the perms after I put it there. Now you should be able to fetch it. :oops:
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 43577
Location: 56N 3W

PostPosted: Thu Apr 19, 2012 8:31 pm    Post subject: Reply with quote

Moriah,

Whats this almost 60 sec wait for?
Code:
[    6.527470] md0: detected capacity change from 0 to 1500301819904
[    6.527654] mdadm used greatest stack depth: 5104 bytes left
[   21.573720]  md0: unknown partition table
[   62.139335] UDF-fs: warning (device dm-1): udf_fill_super: No partition found (1)

It suggests thats something is not happy

You have some microcode missing
Code:
[   75.444803] r600_cp: Failed to load firmware "radeon/R600_rlc.bin"
[   75.444816] [drm:r600_startup] *ERROR* Failed to load firmware


Code:
[   75.355925] r8169 Gigabit Ethernet driver
what hardware do you have on the end of that driver?
I have a card that needs a firmware patch, so it doesn't stall the boot for 60 sec.

Thats all I see in your dmesg. There is nothing relating to permanent stalls
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Thu Apr 19, 2012 10:00 pm    Post subject: Reply with quote

Neddy: I posted this a few days ago, but since it wound up on another page, I am reposting it here:

The lspci -n output is:
Code:

00:00.0 0600: 1022:9600
00:01.0 0604: 1022:9602
00:0a.0 0604: 1022:9609
00:11.0 0106: 1002:4391
00:12.0 0c03: 1002:4397
00:12.1 0c03: 1002:4398
00:12.2 0c03: 1002:4396
00:13.0 0c03: 1002:4397
00:13.1 0c03: 1002:4398
00:13.2 0c03: 1002:4396
00:14.0 0c05: 1002:4385 (rev 3a)
00:14.1 0101: 1002:439c
00:14.3 0601: 1002:439d
00:14.4 0604: 1002:4384
00:14.5 0c03: 1002:4399
00:18.0 0600: 1022:1200
00:18.1 0600: 1022:1201
00:18.2 0600: 1022:1202
00:18.3 0600: 1022:1203
00:18.4 0600: 1022:1204
01:05.0 0300: 1002:9610
01:05.1 0403: 1002:960f
02:00.0 0200: 10ec:8168 (rev 02)
03:06.0 0200: 10ec:8139 (rev 10)
03:07.0 0200: 10ec:8139 (rev 10)
03:0e.0 0c00: 104c:8024


I hope that is sufficient to andswer any questions about the hardware.
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Fri Apr 20, 2012 4:50 am    Post subject: Reply with quote

From my experience, UDF issues are related to optical drives. There could also be an issue in the fact that the hard drives are setup in a different way, to whit:

Quote:
I just completely rebuilt the software for it using your 3.2.1-gentoo-r2-x86_64-08.config and the appropriate kernel release. This box has a 4-core AMD cpu and 8 GB of RAM, along with 3 1.5 TB SATA drives in a hot-swap SATA backplane running Linux software RAID-1 as a 3-way mirror on /dev/md0. /dev/md0 is then encrypted with LUKS and then is made a physical volume under LVM2. The PV has a single volume group on it, and that VG has a single logical volume on it, which is the root filesystem under XFS. The whole mess is booted with an initramfs from USB flash stick as the boot device.


This has been a delicate machine in the past, as I recall. Getting it right has required a bit if trial and error, and it seems likely that we're there again.

As I'm typing this, I am wondering if the initramfs has been redone with every kernel change. If not, that could be a part of the issues. There is also the /usr, udev, and initramfs issue wherein you have to place the entirety of /usr into the initramfs if it's on a separate partition. I'm wondering if this might also hold true for any time that initramfs is required. Admittedly, that's a straw clutch, but it might help.

Where I'd go from here would be to shut off UDF and see if that changes the delay. As long as Moriah isn't going to use the optical drive, it shouldn't be an issue.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Fri Apr 20, 2012 5:09 am    Post subject: Reply with quote

I only use the CD/DVD drive for booting from a liveCD for maintenence work. Since I usually administer this machine remotely, I keep a liveCD in the CD drive at all times, and select it as the boot device from the BIOS when I need to; other times I leave the USB drive as the boot drive and let it boot my convoluted way so I can run the root filesystem as an LVM snapshottable device on the encrypted 3-way RAID-0 mirror.

But to work on this weird kernel problem, I'll make a concession: I will reconfigure the machine to boot from a single drive with no raid and no encryption an no lvm. That single drive will have 2 partitions: a 50 MB boot partition, and a big root filesystem partition. That configuration will let us get to the heart of the kernel problem without the confusion of all that other fancy stuff. Once we get a stable kernel, then I can reconfigure with all that fancy stuff.

I am convinced that even that simple configuration will demonstrate the same problems, as I believe that it is kernel and driver related. Firmware and bios configuration may have something to do with it too, but I do not believe the raid, crypto, lvm, and usb-stick boot device has a thing to do with it.

BTW Pappy, this machine has *NEVER* had a properly stable linux kernel running on it, other than when it boots from a liveCD. I am running another, newer, gigabyte motherboard with an amd f8 cpu chip (8-core) in another machine, and I have had zero trouble with it. Once we get this problem child working, I will let you take a look at that machine's config, dmessg, etc., as you can probably tweak it a bit and make it even better. :wink:
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Fri Apr 20, 2012 5:18 am    Post subject: Reply with quote

Neddy: I think that wait is deliberate. I put sleeps in my init script in my initramfs so I can monitor the boot process visually. This is mainly to assist me in debugging the initramfs, etc., but since I do not boot very often, I leave the sleeps in the script even after I get it working. If anything goes haywire later, this helps me see what is going on.

Here is my init script:
Code:

#!/bin/busybox sh

echo Mounting /proc
mount -t proc none /proc
sleep 5

echo Starting raid arrays
mdadm -A -f /dev/md0 /dev/sda /dev/sdb /dev/sdc
sleep 5

echo Mounting /sys
mount -t sysfs none /sys
sleep 5

echo Creating devices
echo /sbin/mdev > /proc/sys/kernel/hotplug
mdev -s
sleep 5

echo Doing luksOpen
cryptsetup luksOpen /dev/md0 cryptoraid
sleep 5

echo Doing vgscan
lvm vgscan --mknodes
sleep 5

echo Doing vgchange
lvm vgchange -a y
sleep 5

echo Mounting root
mount /dev/mapper/gentoo-rootfs /mnt/root
sleep 5

echo Unmounting /proc and /sys
umount /proc
umount /sys
sleep 5

set +vx

echo Doing the pivot root
exec switch_root /mnt/root /sbin/init

echo OOPS!
sleep 30


No, wait! On second glance, those sleeps are only 5 seconds each. 8O

YIKES! I *DO* have a problem there! :evil:
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Fri Apr 20, 2012 3:47 pm    Post subject: Reply with quote

I'm off to pick up a 1 GB 7200 rpm sata drive. I need it for a spare in an array on another machine anyway, but I will first use it to build a test system for the NEMESIS MACHINE ! :twisted:

This will be a simple straightforward "normal" configuration, as described 2 posts earleir. We can fiddle with this until we get it working properly, then I can prot what we leard to the real setup. :D

I will post again after I have a simple system built on this new drive, and at least booting on the NEMESIS MACHINE !
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Fri Apr 20, 2012 4:49 pm    Post subject: Reply with quote

Yes. This machine has been anything but cooperative. I will be very interested in how it works with a more "normal" setup.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 43577
Location: 56N 3W

PostPosted: Fri Apr 20, 2012 5:40 pm    Post subject: Reply with quote

Moriah.

Code:
02:00.0 0200: 10ec:8168 (rev 02)
is your 1G network card.

Maybe you need W: Possible missing firmware /lib/firmware/rtl_nic/rtl8168d-2.fw for module r8169 W: Possible missing firmware /lib/firmware/rtl_nic/rtl8168d-1.fw for module r8169
See http://www.spinics.net/lists/netdev/msg161236.html

As I say I netboot a box with a card from this family and it sulks for 60 sec looking for firmware.
It makes no apparent difference to the card operation but I wonder if lack of the firmware causes odd behavior?

I have that AMD chipset. My motherboard provides USB Transaction Translation ... do you feel lucky?
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Fri Apr 20, 2012 8:06 pm    Post subject: Reply with quote

In an attempt to perhaps clear a few things, boot off your boot CD, and post the locally created /var/log/dmesg. That should give a good idea of what is and isn't starting. Since the boot CD allows the machine to run in a more stable fashion, that might give an idea of what is and isn't loading, and when.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Fri Apr 20, 2012 11:51 pm    Post subject: Reply with quote

I got the 1 TB drive today. As per my usual policy, I do not use a new drive until I have run badblocks -wv /dev/sd_ on it to make sure I did not start with a problem. I am running badblocks on the new drive right now. It is running from a liveCD booted on the problem child. As soon as it finished (sometime tomorrow) I will post the liveCD dmesg.
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Sat Apr 21, 2012 1:29 am    Post subject: Reply with quote

Okay. It should prove interesting to see what the live CD has to say about what's going on with the machine.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Sat Apr 21, 2012 9:43 pm    Post subject: Reply with quote

OK, the dmesg from the livecd boot is here:

http://www.elilabs.com/~rj/livecd.dmesg

Hopefully this will explain a few things. :twisted:
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Sat Apr 21, 2012 10:55 pm    Post subject: Reply with quote

OK, I built as plain of a vanilla system as I could using Pappy's config that he linked to a few days back. The only think I did was to disable ACPI, then I built it on a 1TB sata drive with a 50M boot partition on /dev/sda1 and the rest of the drive on /dev/sda2. The /dev/sda1 is ext2, and the /dev/sda2 is ext3. I had to do the usual fiddle with udev to get the nics where I wanted them. Here is the dmesg:

http://www.elilabs.com/~rj/ezekiel.dmesg
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Sun Apr 22, 2012 5:06 pm    Post subject: Reply with quote

BTW Pappy, although I know that the way you configured the video frame buffer in the config you gave me shows more lines, and this makes it easier to see what happened when the system doesn't boot, once we get this thing working well enough, I want to switch back to the old frame buffer. The reason for this is that my ipeps (see http://www.adder.com/products/adderlink-ipeps for details) works better (has faster response) with the smaller frame buffer.
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Sun Apr 22, 2012 6:07 pm    Post subject: Reply with quote

Sorry I haven't gotten to your stuff yet. My car is in a state of probable death, and as you can understand, that's been occupying my thoughts. I'm going to give it the shakedown today, and see if I'm right, or just paranoid. I'll take a look at things later on this evening.

Wish me luck.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Mon Apr 23, 2012 5:18 am    Post subject: Reply with quote

My car is in need of a complete valve job, or head replacement. The timing belt slipped some teeth...and with an interference engine, that's never a good thing.

I have your dmesg posts, and I'll do some looking tonight before I go to bed. It's going to be interesting.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
NeddySeagoon
Administrator
Administrator


Joined: 05 Jul 2003
Posts: 43577
Location: 56N 3W

PostPosted: Mon Apr 23, 2012 6:07 pm    Post subject: Reply with quote

pappy_mcfae,

I hope it is only the top end. I've known holes in pistons, which is far more serious.
_________________
Regards,

NeddySeagoon

Computer users fall into two groups:-
those that do backups
those that have never had a hard drive fail.
Back to top
View user's profile Send private message
Moriah
Advocate
Advocate


Joined: 27 Mar 2004
Posts: 2117
Location: Kentucky

PostPosted: Mon Apr 23, 2012 10:25 pm    Post subject: Reply with quote

Been there; done that -- many years ago with the holes in pistons routine. Total engine rebuild time, which is worth it if the car is a goodie, but not if the whole car is a total rust bucket. Being in Texas, I would imagine the rust is not too bad. The other option is an engine swap, but if its a used junk yard engine, then you never know what you are getting. Maybe its time to donate that car to your favorite charity and get another one. :?
_________________
The MyWord KJV Bible tool is at http://www.elilabs.com/~myword

Foghorn Leghorn is a Warner Bros. cartoon character.
Back to top
View user's profile Send private message
pappy_mcfae
Watchman
Watchman


Joined: 27 Dec 2007
Posts: 5999
Location: Pomona, California.

PostPosted: Mon Apr 23, 2012 11:34 pm    Post subject: Reply with quote

I will definitely know more when I take off the head tomorrow. I got the car numerous years ago when the original owner, get this, broke the timing belt while turning left. At that time, it took out four intake and four exhaust valves. At that time, there was no damage to the pistons, as the valves have very thin stems, and seem to be made to take the abuse, leaving the pistons unharmed.

This time, I'm thinking all sixteen valves have bent. I'm fairly sure that the pistons survived this time as well since the belt slipped while cranking. Even if I do lose a piston or two, I have lots of experience in doing in-car overhauls.

I had another Mitsubishi that had a bad piston. While I really should have replaced the motor in that car, it was cheap, as was the labor. Since I was working in a shop at the time, I had the benefit of a lift, and power tools. I don't have those advantages, but I do know how to gut motors while they're still in the car. I must say that it's wonderful to have the knowledge of things that run on gas, diesel, or electricity.

Wish me luck!

Moriah,

I'll be checking out the dmesg posts after I get done with my emails. They've been collecting for the last few days.

Cheers,
Pappy
_________________
This space left intentionally blank, except for these ASCII symbols.
Back to top
View user's profile Send private message
Display posts from previous:   
This topic is locked: you cannot edit posts or make replies.    Gentoo Forums Forum Index Unsupported Software All times are GMT
Goto page Previous  1, 2, 3 ... 11, 12, 13 ... 18, 19, 20  Next
Page 12 of 20

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum