[Linux-Biella] Help VM zombie

Daniele Segato daniele.bilug a gmail.com
Lun 8 Ago 2011 12:39:22 CEST


On Tue, 2011-07-12 at 09:16 +0200, Daniele Segato wrote:
> Ciao,
> 
> <premessa>
> io non sono un sistemista :)
> </premessa>
> 
> Ho una macchina virtuale Virtualbox che ogni tanto smette di
> rispondere, l'unica è spegnerla e riavviarla.. nei log nulla di nulla.
> 
> l'host è una Debian Stable, la VM è una Debian Testing.
> 
> 
> La macchina virtuale gira in headless mode, senza X con N servizi.
> accedo sempre in ssh (o via http sui vari servizi a disposizione)
> 
> ogni tanto smette di rispondermi sia via web che via ssh.
> 
> nei log di sistema non vedo nulla di significativo
> 
> la mia ipotesi è che qualcuno dei servizi vada a tappo con CPU / RAM
> per qualche bug.....
> ma non so come avvalorarla....
> 
> che fare? :)


ok...
sono riuscito a vedere cos'è successo


problemi al disco *virtuale* della mia VM...

ho guardato nella macchina host (non so come mi sia sfuggito prima,
forse semplicemente non ho pensato ad un problema nella macchina host e
non ho guardato).

vostra opinione:
disco andato?

> [399204.732141] ata1: EH in SWNCQ mode,QC:qc_active 0x1 sactive 0x1
> [399204.732192] ata1: SWNCQ:qc_active 0x1 defer_bits 0x0 last_issue_tag 0x0
> [399204.732193]   dhfis 0x1 dmafis 0x0 sdbfis 0x0
> [399204.732245] ata1: ATA_REG 0x50 ERR_REG 0x0
> [399204.732269] ata1: tag : dhfis dmafis sdbfis sacitve
> [399204.732294] ata1: tag 0x0: 1 0 0 1  
> [399204.732331] ata1.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x6 frozen
> [399204.732363] ata1.00: failed command: WRITE FPDMA QUEUED
> [399204.732393] ata1.00: cmd 61/00:00:40:f2:19/04:00:02:00:00/40 tag 0 ncq 524288 out
> [399204.732395]          res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [399204.732479] ata1.00: status: { DRDY }
> [399204.732508] ata1: hard resetting link
> [399204.732510] ata1: nv: skipping hardreset on occupied port
> [399205.200134] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
> [399205.240780] ata1.00: configured for UDMA/133
> [399205.240811] ata1.00: device reported invalid CHS sector 0
> [399205.240841] ata1: EH complete
> [399235.732076] ata1: EH in SWNCQ mode,QC:qc_active 0x1 sactive 0x1
> [399235.732117] ata1: SWNCQ:qc_active 0x1 defer_bits 0x0 last_issue_tag 0x0
> [399235.732118]   dhfis 0x1 dmafis 0x0 sdbfis 0x0
> [399235.732170] ata1: ATA_REG 0x50 ERR_REG 0x0
> [399235.732193] ata1: tag : dhfis dmafis sdbfis sacitve
> [399235.732222] ata1: tag 0x0: 1 0 0 1  
> [399235.732265] ata1.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x6 frozen
> [399235.732297] ata1.00: failed command: WRITE FPDMA QUEUED
> [399235.732332] ata1.00: cmd 61/00:00:40:f2:19/04:00:02:00:00/40 tag 0 ncq 524288 out
> [399235.732333]          res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [399235.732424] ata1.00: status: { DRDY }
> [399235.732454] ata1: hard resetting link
> [399235.732456] ata1: nv: skipping hardreset on occupied port
> [399236.200038] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
> [399236.224438] ata1.00: configured for UDMA/133
> [399236.224469] ata1.00: device reported invalid CHS sector 0
> [399236.224499] ata1: EH complete
> [399266.733803] ata1: EH in SWNCQ mode,QC:qc_active 0x1 sactive 0x1
> [399266.733839] ata1: SWNCQ:qc_active 0x1 defer_bits 0x0 last_issue_tag 0x0
> [399266.733840]   dhfis 0x1 dmafis 0x0 sdbfis 0x0
> [399266.733900] ata1: ATA_REG 0x50 ERR_REG 0x0
> [399266.733923] ata1: tag : dhfis dmafis sdbfis sacitve
> [399266.733952] ata1: tag 0x0: 1 0 0 1  
> [399266.733989] ata1.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x6 frozen
> [399266.734026] ata1.00: failed command: WRITE FPDMA QUEUED
> [399266.734056] ata1.00: cmd 61/00:00:40:f2:19/04:00:02:00:00/40 tag 0 ncq 524288 out
> [399266.734057]          res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [399266.734145] ata1.00: status: { DRDY }
> [399266.734174] ata1: hard resetting link
> [399266.734177] ata1: nv: skipping hardreset on occupied port
> [399267.200050] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
> [399267.226105] ata1.00: configured for UDMA/133
> [399267.226132] ata1.00: device reported invalid CHS sector 0
> [399267.226156] ata1: EH complete
> [399297.745562] ata1: EH in SWNCQ mode,QC:qc_active 0x1 sactive 0x1
> [399297.745598] ata1: SWNCQ:qc_active 0x1 defer_bits 0x0 last_issue_tag 0x0
> [399297.745599]   dhfis 0x1 dmafis 0x0 sdbfis 0x0
> [399297.745655] ata1: ATA_REG 0x50 ERR_REG 0x0
> [399297.745678] ata1: tag : dhfis dmafis sdbfis sacitve
> [399297.745704] ata1: tag 0x0: 1 0 0 1  
> [399297.745743] ata1.00: NCQ disabled due to excessive errors
> [399297.745748] ata1.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x6 frozen
> [399297.745797] ata1.00: failed command: WRITE FPDMA QUEUED
> [399297.745835] ata1.00: cmd 61/00:00:40:f2:19/04:00:02:00:00/40 tag 0 ncq 524288 out
> [399297.745836]          res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [399297.745927] ata1.00: status: { DRDY }
> [399297.745957] ata1: hard resetting link
> [399297.745959] ata1: nv: skipping hardreset on occupied port
> [399298.212119] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
> [399298.236355] ata1.00: configured for UDMA/133
> [399298.236383] ata1.00: device reported invalid CHS sector 0
> [399298.236415] ata1: EH complete



Maggiori informazioni sulla lista Linux