|
|
Author
|
Topic: Fixing recursive error but reboot is needed?
|
|
|
|
|
Dustin Grush
Film Handler
Posts: 14
From: Johnstown, PA, USA
Registered: Apr 2018
|
posted 07-31-2019 06:41 PM
Thanks to all who replied above. I took Carstens advice and ran the log analyzer when I got in this evening and the results are below. Everything booted uneventfully. I have yet to try to ingest another CRU drive. I substituted the theatre name with an * to protect the guilty Any thoughts are greatly appreciated. I figured it would be a good idea to put this here to help the next person: ingestc Log file Truncated by 'cleanlog.sh' in drmreport/doremi/log/ingestc.log : Line 1:[Mon Jul 29 20:34:21 EDT 2019] *** Truncated by 'cleanlog.sh' ***
The system had to truncate the above log files. As a consequence some logged information has been erased. Drive unhandled error in drmreport/doremi/log/kern.log : Line 7766:Jul 31 01:02:12 2019 * kernel: sd 7:0:0:0: [sdh] Unhandled error code
Line 7777:Jul 31 01:02:12 2019 * kernel: sd 7:0:0:0: [sdh] Unhandled error code
An unhandled error occurred on a drive. CRU Failure (frozen) in drmreport/doremi/log/kern.log : Line 5475:Jul 28 19:50:41 2019 * kernel: ata5: exception Emask 0x10 SAct 0x0 SErr 0x4010000 action 0xe frozen
Line 5498:Jul 28 20:00:04 2019 * kernel: ata5: exception Emask 0x10 SAct 0x0 SErr 0x4040000 action 0xe frozen
---Line 5546:Jul 28 21:42:15 2019 * kernel: ata5: exception Emask 0x10 SAct 0x0 SErr 0x4010000 action 0xe frozen---(This one is odd. There was no CRU drive in the bay at this time. ) The drive in the CRU is not responding. This could mean a drive issue or a CRU reader issue. Note: Drive sde is the drive in the ingest CRU as well known as ata5 in the system. Note: On some units the CRU may be plugged to the ata6 motherboard socket in this case this error would mean a CD-ROM issue. It took a bit longer to shut down last night, screen said something about USB, dont remember exactly Bad connection between the motherboard and an unknown hdd in drmreport/doremi/log/kern.log : Line 7772:Jul 31 01:02:12 2019 * kernel: Buffer I/O error on device sdh1, logical block 2
Line 7773:Jul 31 01:02:12 2019 * kernel: Buffer I/O error on device sdh1, logical block 3
Line 7783:Jul 31 01:02:12 2019 * kernel: Buffer I/O error on device sdh1, logical block 32
Line 7784:Jul 31 01:02:12 2019* kernel: Buffer I/O error on device sdh1, logical block 33
Line 7786:Jul 31 01:02:12 2019 * kernel: Buffer I/O error on device sdh1, logical block 4
Line 7787:Jul 31 01:02:12 2019 * kernel: Buffer I/O error on device sdh1, logical block 5
Line 7788:Jul 31 01:02:12 2019* kernel: Buffer I/O error on device sdh1, logical block 6
Line 7789:Jul 31 01:02:12 2019 * kernel: Buffer I/O error on device sdh1, logical block 7
Line 7790:Jul 31 01:02:12 2019 * kernel: Buffer I/O error on device sdh1, logical block 8
Line 7791:Jul 31 01:02:12 2019 * kernel: Buffer I/O error on device sdh1, logical block 9
An I/O errors have been detected in the Kern.log, this log is monitoring all hardware connected to the motherboard. An I/O error means one of the drives is faulty and this normally means the drive must be replaced. Bad connection between the motherboard and an external USB hdd sdh in drmreport/doremi/log/kern.log : Line 7771:Jul 31 01:02:12 2019 * kernel: end_request: I/O error, dev sdh, sector 2064
Line 7782:Jul 31 01:02:12 2019 * kernel: end_request: I/O error, dev sdh, sector 2304
An I/O errors have been detected in the Kern.log, this log is monitoring all hardware connected to the motherboard. An I/O error means one of the drives is faulty and this normally means the drive must be replaced. sdh is an external drive connected to the system using USB. NTP no server suitable for synchronization found in drmreport/doremi/log/time.log : Line 3714:[Mon Jul 29 20:34:37 2019][ERROR]: ntpdate: no server suitable for synchronization found
Line 3716:[Mon Jul 29 20:34:44 2019][ERROR]: ntpdate: no server suitable for synchronization found
Edit the configuration file /doremi/etc/ntpservers and specify a valid NTP server under the variable: NTPSERVERS="" Found a NTP server close to your location at www.pool.ntp.org, or if you have an TLMS, set the target at the IP address of the TLMS server. To make sure the server is available from the Doremi server ping it from a terminal window. It is highly recommended to use NTP, if the server drifts and RTC goes outside the allowed amount it is not covered under warranty I have since deleted a few things to get below 85%. RAID Partion md0 has been overused in drmreport/doremi/log/sensors.log : Line 10682:2019-07-24T20:36:23-04:00,1564014983,STOR0,active,96%,active,active,active
Line 10696:2019-07-25T20:49:17-04:00,1564102157,STOR0,active,96%,active,active,active
Line 10707:2019-07-26T20:06:48-04:00,1564186008,STOR0,active,96%,active,active,active
Line 10719:2019-07-27T20:56:40-04:00,1564275400,STOR0,active,87%,active,active,active
Line 10730:2019-07-28T19:48:49-04:00,1564357729,STOR0,active,87%,active,active,active
Line 10744:2019-07-29T20:34:45-04:00,1564446885,STOR0,active,89%,active,active,active
Line 10759:2019-07-30T20:38:18-04:00,1564533498,STOR0,active,89%,active,active,active
Thanks again.
| IP: Logged
|
|
Leo Enticknap
Film God
Posts: 7474
From: Loma Linda, CA
Registered: Jul 2000
|
posted 07-31-2019 07:52 PM
This is a wild guess, but it worked for me once in the past, with a DCP2K4 that was doing weird s*** similar to this (e.g. complaining that a CRU drive was causing it to lock up, when the CRU bay was empty and had been since the previous reboot), and for which no rational explanation could be found.
Replacing the BIOS settings battery on the motherboard fixed it.
I tried this, because my experience of PCs is that they can become unstable, throw random BSODs, etc., if they're running with a totally dead BIOS battery, and I happened to have spare BR2032s and CR2032s with me. As soon as I put a new one in, all the problems went away.
I now replace them as a matter of routine whenever I open up an old school server, unless I know that it has been replaced within the last year.
| IP: Logged
|
|
|
|
|
|
|
|
|
|
All times are Central (GMT -6:00)
|
|
Powered by Infopop Corporation
UBB.classicTM
6.3.1.2
The Film-Tech Forums are designed for various members related to the cinema industry to express their opinions, viewpoints and testimonials on various products, services and events based upon speculation, personal knowledge and factual information through use, therefore all views represented here allow no liability upon the publishers of this web site and the owners of said views assume no liability for any ill will resulting from these postings. The posts made here are for educational as well as entertainment purposes and as such anyone viewing this portion of the website must accept these views as statements of the author of that opinion
and agrees to release the authors from any and all liability.
|