New user's registration have been closed due to high spamming and low trafic on this forum. Please contact forum admins directly if you need an account. Thanks !

Bubba hangs

Got problems with your B2 or B3? Share and get helped!
Post Reply
Gunnarsson
Posts: 31
Joined: 26 Mar 2009, 07:15

Bubba hangs

Post by Gunnarsson »

Hi

I've just got my bubbaTWO 1TB but when I try to transfer my music it just hangs after 5min or so. Then I cant acess it in any whay but it responds to ping.
So then I have to pull the plug on it and restart. I have the latest F/W.

Dont know what to do... :cry:

The last lines in Syslog before the hang:

Code: Select all

Apr 1 21:22:47 Nexus kernel: of-fsl-dma-channel e0008100.dma-channe: DMA halt timeout!
Apr 1 21:22:48 Nexus ntpd[2533]: kernel time sync enabled 4001
Apr 1 21:24:14 Nexus kernel: of-fsl-dma-channel e0008100.dma-channe: DMA halt timeout!
Apr 1 21:25:38 Nexus /USR/SBIN/CRON[3256]: (root) CMD (test -x /usr/bin/php && /usr/bin/php /usr/share/horde3/scripts/alarms.php)
Apr 1 21:26:56 Nexus kernel: of-fsl-dma-channel e0008100.dma-channe: DMA halt timeout!
Apr 1 21:28:36 Nexus kernel: of-fsl-dma-channel e0008100.dma-channe: DMA halt timeout!

# Top Idle

Code: Select all

 
 2601 mediatom  20   0  907m 145m 1380 S  9.0 57.9 528:18.70 mediatomb
 4579 root      20   0  2976 1164  916 R  3.6  0.5   0:00.08 top
    1 root      20   0  2440  168  148 S  0.0  0.1   0:03.16 init
    2 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 kthreadd
    3 root      15  -5     0    0    0 S  0.0  0.0   0:00.26 ksoftirqd/0
    4 root      15  -5     0    0    0 S  0.0  0.0   0:47.48 events/0
    5 root      15  -5     0    0    0 S  0.0  0.0   0:00.10 khelper
   52 root      15  -5     0    0    0 S  0.0  0.0   0:40.38 kblockd/0
   59 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 ata/0
   60 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 ata_aux
   68 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 khubd
  131 root      15  -5     0    0    0 D  0.0  0.0   6:21.52 kswapd0
  132 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 aio/0
  133 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 nfsiod
  723 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 scsi_eh_0
  726 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 scsi_eh_1
  729 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 scsi_eh_2

# Top while uploading

Code: Select all

 4595 nobody    20   0 12448 2608 1936 R 25.8  1.0   0:10.12 smbd
 2601 mediatom  20   0  907m 184m 1260 S 16.8 73.7 531:27.73 mediatomb
 4460 root      20   0     0    0    0 S  5.2  0.0   0:00.76 pdflush
 2301 mysql     20   0  127m 3448 1704 S  3.5  1.3 100:19.52 mysqld
 1979 root      15  -5     0    0    0 S  1.3  0.0   0:06.36 kjournald
 4630 root      20   0  2980 1252  996 R  1.3  0.5   0:00.14 top
   52 root      15  -5     0    0    0 S  0.6  0.0   0:42.16 kblockd/0
  131 root      15  -5     0    0    0 D  0.6  0.0   6:38.62 kswapd0
    1 root      20   0  2440  168  148 S  0.0  0.1   0:03.18 init
    2 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 kthreadd
    3 root      15  -5     0    0    0 S  0.0  0.0   0:00.26 ksoftirqd/0
    4 root      15  -5     0    0    0 S  0.0  0.0   0:48.42 events/0
    5 root      15  -5     0    0    0 S  0.0  0.0   0:00.10 khelper
   59 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 ata/0
   60 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 ata_aux
   68 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 khubd
  132 root      15  -5     0    0    0 S  0.0  0.0   0:00.00 aio/0
Last lines in the Samba log-file

Code: Select all

[2009/04/01 20:04:10, 0] lib/util_sock.c:write_data(562)
write_data: write failure in writing to client 192.168.0.181. Error Connection reset by peer
[2009/04/01 20:04:10, 0] lib/util_sock.c:send_smb(761)
Error writing 4 bytes to client. -1. (Connection reset by peer)
[2009/04/01 20:04:28, 1] smbd/service.c:make_connection_snum(950)
stor (192.168.0.181) connect to service storage initially as user nobody (uid=65534, gid=100) (pid 3013)
[2009/04/01 20:05:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:05:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:06:27, 1] smbd/service.c:make_connection_snum(950)
daniel (192.168.0.5) connect to service storage initially as user nobody (uid=65534, gid=100) (pid 3019)
[2009/04/01 20:06:43, 1] smbd/service.c:close_cnum(1150)
daniel (192.168.0.5) closed connection to service storage
[2009/04/01 20:15:44, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:15:44, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:17:29, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:17:29, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:18:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:18:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:28:15, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:28:15, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:31:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:31:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:32:28, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:32:28, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:41:13, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:41:13, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:44:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:44:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:47:29, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:47:29, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:54:28, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:54:28, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:57:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 20:57:17, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:01:54, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:01:54, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:01:54, 1] smbd/service.c:make_connection_snum(950)
daniel (192.168.0.5) connect to service storage initially as user nobody (uid=65534, gid=100) (pid 3198)
[2009/04/01 21:07:41, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:07:41, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:13:52, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:13:52, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:14:28, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:14:28, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:15:44, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:15:44, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:20:54, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:20:54, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:28:44, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:28:44, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:29:28, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
[2009/04/01 21:29:28, 0] printing/print_cups.c:cups_cache_reload(85)
Unable to connect to CUPS server localhost - Connection refused
Apache2 log

Code: Select all

[Tue Mar 31 21:23:28 2009] [notice] FastCGI: process manager initialized (pid 2763)
[Tue Mar 31 21:23:31 2009] [warn] RSA server certificate is a CA certificate (BasicConstraints: CA == TRUE !?)
[Tue Mar 31 21:23:31 2009] [warn] RSA server certificate CommonName (CN) `localhost.localdomain' does NOT match server name!?
[Tue Mar 31 21:23:31 2009] [notice] Apache/2.2.3 (Debian) mod_fastcgi/2.4.2 PHP/5.2.6-5ex1 with Suhosin-Patch mod_ssl/2.2.3 OpenSSL/0.9.8c configured -- resuming normal operations
[Wed Apr 01 07:36:30 2009] [error] [client 221.136.64.122] client sent HTTP/1.1 request without hostname (see RFC2616 section 14.23): /w00tw00t.at.ISC.SANS.DFind:)
[Wed Apr 01 17:22:22 2009] [notice] caught SIGTERM, shutting down
[Wed Apr 01 17:24:53 2009] [warn] RSA server certificate is a CA certificate (BasicConstraints: CA == TRUE !?)
[Wed Apr 01 17:24:53 2009] [warn] RSA server certificate CommonName (CN) `localhost.localdomain' does NOT match server name!?
[Wed Apr 01 17:24:56 2009] [notice] FastCGI: process manager initialized (pid 2635)
[Wed Apr 01 17:25:02 2009] [warn] RSA server certificate is a CA certificate (BasicConstraints: CA == TRUE !?)
[Wed Apr 01 17:25:02 2009] [warn] RSA server certificate CommonName (CN) `localhost.localdomain' does NOT match server name!?
[Wed Apr 01 17:25:02 2009] [notice] Apache/2.2.3 (Debian) mod_fastcgi/2.4.2 PHP/5.2.6-5ex1 with Suhosin-Patch mod_ssl/2.2.3 OpenSSL/0.9.8c configured -- resuming normal operations
booskunk
Posts: 7
Joined: 13 Feb 2009, 18:33

DMA error

Post by booskunk »

I started to get this same DMA error today. I wonder if this is a symptom of overheating.
Binkem
Posts: 388
Joined: 10 Jul 2008, 02:26

Post by Binkem »

Try and check out HHDtemp then. This shouldn't be a problem though unless Bubba is in a confined space. My Bubba runs at just under 40 degrees, which is a safe temperature.
booskunk
Posts: 7
Joined: 13 Feb 2009, 18:33

Post by booskunk »

Do I need to compile hhdtemp from the source? I tried installing smartmontools right after getting my Bubba Two but couldn't get it to work.

Regarding my hang, local disk accesses don't seem to be a problem. But copying files to the Bubba samba server over the network causes the hang. I checked the case temperature with a laser thermometer and it was 38 degrees celcius.
johannes
Posts: 1470
Joined: 31 Dec 2006, 07:12
Location: Sweden
Contact:

Post by johannes »

Gunnarsson and others;

Did you upgrade to the latest kernel (2.6.26.5-8), according to http://forum.excito.net/viewtopic.php?t=1685&start=30? This should solve this issue.

If the upgrade didn't work, please email support@excito.com and we'll figure out why. The new kernel fixes exactly your problem.
/Johannes (Excito co-founder a long time ago, but now I'm just Johannes)
booskunk
Posts: 7
Joined: 13 Feb 2009, 18:33

Post by booskunk »

Installed hddtemp. /dev/sda temperature reaches 43. I did an rsync transfer but didn't see the problem. The hang happened when I did a samba transfer. I also had an ssh shell going.
booskunk
Posts: 7
Joined: 13 Feb 2009, 18:33

Post by booskunk »

I was at 2.6.26.5-2. Upgrading now. Thanks.
johannes
Posts: 1470
Joined: 31 Dec 2006, 07:12
Location: Sweden
Contact:

Post by johannes »

Great.

(And FYI: The version I ment was 2.6.26.5-8, sorry for the autoinserted cool smiley).
/Johannes (Excito co-founder a long time ago, but now I'm just Johannes)
booskunk
Posts: 7
Joined: 13 Feb 2009, 18:33

Post by booskunk »

OK, 2.6.26.5-8 makes things better, but there is still a problem.
Summary of tests so far:

2.6.26.5-2 Vista samba copy to external esata: DMA error hang
2.6.26.5-2 Vista samba copy to internal drive: DMA error hang
2.6.26.5-2 Vista rsync copy to internal drive: success
2.6.26.5-8 Vista samba copy to external esata: hang
2.6.26.5-8 Vista samba copy to internal drive: success
2.6.26.5-8 XP samba copy to external esata: success

The DMA errors are gone, but I still get a hang when using a Vista client and samba to the external drive. I noticed that the "Bubba freeze copy files half solved ;)" thread also reported problems with Vista as the client. The 2.6.26.5-8 hang is not as hard as the 2.6.26.5-2 hang. Networking is still alive, because my ssh session will echo characters (after a long delay) but won't give a new prompt.

Found a Samba nmbd crash, but it doesn't explain the ssh and web hang:

[2009/05/18 13:13:04, 0] lib/util.c:smb_panic(1599)
PANIC (pid 2703): internal error
[2009/05/18 13:13:04, 0] lib/util.c:log_stack_trace(1706)
BACKTRACE: 13 stack frames:
#0 /usr/sbin/nmbd(log_stack_trace+0x34) [0x100a6cd4]
#1 /usr/sbin/nmbd(smb_panic+0x54) [0x100a6e14]
#2 /usr/sbin/nmbd [0x1008eae4]
#3 [0xbfd59fb8]
#4 [0xbfd5a30c]
#5 /usr/sbin/nmbd(retransmit_or_expire_response_records+0x228) [0x1002b008]
#6 /usr/sbin/nmbd(unbecome_local_master_browser+0x164) [0x1001dae4]
#7 /usr/sbin/nmbd [0x1001dca4]
#8 /usr/sbin/nmbd [0x100282d0]
#9 /usr/sbin/nmbd(run_packet_queue+0x390) [0x1002c8c0]
#10 /usr/sbin/nmbd(main+0x704) [0x1001bc74]
#11 /lib/tls/libc.so.6 [0xfbae994]
#12 /lib/tls/libc.so.6(__libc_start_main+0xb0) [0xfbaead0]

The core file gives better symbols:

#0 0x0fbc5f6c in raise () from /lib/tls/libc.so.6
#1 0x0fbc7a0c in abort () from /lib/tls/libc.so.6
#2 0x1008e5d0 in dump_core ()
#3 0x100a6ec4 in smb_panic ()
#4 0x1008eae4 in dump_core_setup ()
#5 <signal handler called>
#6 0x100300b0 in remove_response_record ()
#7 0x1002b008 in retransmit_or_expire_response_records ()
#8 0x1001dae4 in unbecome_local_master_browser ()
#9 0x1001dca4 in unbecome_local_master_browser ()
#10 0x100282d0 in wins_refresh_name ()
#11 0x1002c8c0 in run_packet_queue ()
#12 0x1001bc74 in main ()
Post Reply