{"id":218,"date":"2016-12-02T18:09:02","date_gmt":"2016-12-02T18:09:02","guid":{"rendered":"http:\/\/luliserver.com.ar\/?p=218"},"modified":"2023-12-19T23:42:33","modified_gmt":"2023-12-20T02:42:33","slug":"sata-hotswap-con-mdadm-raid","status":"publish","type":"post","link":"http:\/\/luliserver.com.ar\/?p=218","title":{"rendered":"SATA hotswap con mdadm RAID"},"content":{"rendered":"<p>Contento de haber recibido, hace rato(m\u00e1s de un a\u00f1o), el nuevo gabinete para realizar hotswap nunca hab\u00eda realizado dicha acci\u00f3n. Hoy pequ\u00e9 no siguiendo una de las frase de IT: <code>Si funciona, NO lo toques<\/code>.<br \/>\nComo se que la desgracia se sienta al lado nuestro en tiempos que todo explota,\u00a0pase a probar.<\/p>\n<p><span style=\"text-decoration: underline;\">Entorno:<\/span><br \/>\nTengo un fileserver y lamentablemente me quedo un RAID1 con un solo disco. Como lo utilizo como Papelera de Reciclaje de las cuentas de Samba honestamente no me interesa mucho lo que se guarde ah\u00ed m\u00e1s s\u00ed se rompe no ser\u00eda grabe el da\u00f1o.<br \/>\nEl estado actual es:<\/p>\n<pre>root@server:~# cat \/proc\/mdstat\r\nPersonalities : [raid1]\r\nmd0 : active raid1 sdf[1]\r\n      976761424 blocks super 1.2 [2\/1] [_U]\r\n<\/pre>\n<p>As\u00ed fueron los pasos a seguir:<\/p>\n<ol>\n<li>Desmonto la unidad:\n<pre>umount \/mnt\/md0\/<\/pre>\n<\/li>\n<li>Hago un stop al raid:\n<pre>mdadm --manage \/dev\/md0 --stop<\/pre>\n<\/li>\n<li>Ponemos el disco r\u00edgido en standby:\n<pre>hdparm -Y \/dev\/sdf<\/pre>\n<\/li>\n<li>Y lo sacamos en caliente.<\/li>\n<\/ol>\n<p>Le\u00ed por ah\u00ed que debemos:<\/p>\n<div class=\"comment-content\">\n<blockquote><p>echo 1 &gt; \/sys\/block\/sdb\/device\/delete<\/p>\n<p>Just in case the kernel has not realized, that the drive has been physically pulled out. (Source: <a href=\"http:\/\/serverfault.com\/questions\/5336\/how-do-i-make-linux-recognize-a-new-sata-dev-sda-drive-i-hot-swapped-in-without\" rel=\"nofollow\" data-slimstat-clicked=\"false\" data-slimstat-type=\"0\" data-slimstat-tracking=\"true\" data-slimstat-callback=\"true\">http:\/\/serverfault.com\/questions\/5336\/how-do-i-make-linux-recognize-a-new-sata-dev-sda-drive-i-hot-swapped-in-without<\/a>)<\/p><\/blockquote>\n<\/div>\n<p>Como no hice esto, revise los logs:<\/p>\n<pre>Dec  2 12:51:45 server kernel: [793838.144423] ata6: exception Emask 0x10 SAct 0x0 SErr 0x4090000 action 0xe frozen\r\nDec  2 12:51:45 server kernel: [793838.144481] ata6: irq_stat 0x00400040, connection status changed\r\nDec  2 12:51:45 server kernel: [793838.144528] ata6: SError: { PHYRdyChg 10B8B DevExch }\r\nDec  2 12:51:45 server kernel: [793838.144572] ata6: hard resetting link\r\nDec  2 12:51:46 server kernel: [793838.863111] ata6: SATA link down (SStatus 0 SControl 300)\r\nDec  2 12:51:51 server kernel: [793843.852521] ata6: hard resetting link\r\nDec  2 12:51:51 server kernel: [793844.171849] ata6: SATA link down (SStatus 0 SControl 300)\r\nDec  2 12:51:51 server kernel: [793844.171860] ata6: limiting SATA link speed to 1.5 Gbps\r\nDec  2 12:51:56 server kernel: [793849.161272] ata6: hard resetting link\r\nDec  2 12:51:57 server kernel: [793849.480594] ata6: SATA link down (SStatus 0 SControl 310)\r\nDec  2 12:51:57 server kernel: [793849.480603] ata6.00: disabled\r\nDec  2 12:51:57 server kernel: [793849.480614] ata6: EH complete\r\nDec  2 12:51:57 server kernel: [793849.480623] ata6.00: detaching (SCSI 5:0:0:0)\r\nDec  2 12:51:57 server kernel: [793849.480940] sd 5:0:0:0: [sdf] Synchronizing SCSI cache\r\nDec  2 12:51:57 server kernel: [793849.480977] sd 5:0:0:0: [sdf]  Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK\r\nDec  2 12:51:57 server kernel: [793849.480981] sd 5:0:0:0: [sdf] Stopping disk\r\nDec  2 12:51:57 server kernel: [793849.480988] sd 5:0:0:0: [sdf] START_STOP FAILED\r\nDec  2 12:51:57 server kernel: [793849.480990] sd 5:0:0:0: [sdf]  Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK\r\n<\/pre>\n<p>Como tengo mucha curiosidad prob\u00e9 los pasos:<\/p>\n<ol>\n<li>Desmonto la unidad:\n<pre>umount \/mnt\/md0\/<\/pre>\n<\/li>\n<li>Hago un stop al raid:\n<pre>mdadm --manage \/dev\/md0 --stop<\/pre>\n<\/li>\n<li>Ponemos el disco r\u00edgido en standby:\n<pre>hdparm -Y \/dev\/sdf<\/pre>\n<\/li>\n<li>Borramos f\u00edsicamente del kernel el disco:\n<pre>echo 1 &gt; \/sys\/block\/sdf\/device\/delete<\/pre>\n<\/li>\n<li>Y lo sacamos en caliente.<\/li>\n<\/ol>\n<p>Logs:<\/p>\n<pre>Dec  2 14:36:15 srv-it kernel: [800094.929215] sd 5:0:0:0: [sdf] Synchronizing SCSI cache\r\nDec  2 14:36:15 srv-it kernel: [800094.929259] ata6.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6\r\nDec  2 14:36:15 srv-it kernel: [800094.929328] ata6.00: waking up from sleep\r\nDec  2 14:36:15 srv-it kernel: [800094.929367] ata6: hard resetting link\r\nDec  2 14:36:16 srv-it kernel: [800095.245764] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)\r\nDec  2 14:36:16 srv-it kernel: [800095.246223] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20110623\/psargs-359)\r\nDec  2 14:36:16 srv-it kernel: [800095.246232] ACPI Error: Method parse\/execution failed [\\_SB_.PCI0.SAT0.SPT5._GTF] (Node ffff8806060c0588), AE_NOT_FOUND (20110623\/psparse-536)\r\nDec  2 14:36:16 srv-it kernel: [800095.246854] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20110623\/psargs-359)\r\nDec  2 14:36:16 srv-it kernel: [800095.246860] ACPI Error: Method parse\/execution failed [\\_SB_.PCI0.SAT0.SPT5._GTF] (Node ffff8806060c0588), AE_NOT_FOUND (20110623\/psparse-536)\r\nDec  2 14:36:16 srv-it kernel: [800095.247041] ata6.00: configured for UDMA\/133\r\nDec  2 14:36:16 srv-it kernel: [800095.247045] ata6.00: retrying FLUSH 0xea Emask 0x0\r\nDec  2 14:36:16 srv-it kernel: [800095.247112] ata6: EH complete\r\nDec  2 14:36:16 srv-it kernel: [800095.247186] sd 5:0:0:0: [sdf] Stopping disk\r\nDec  2 14:36:16 srv-it kernel: [800095.247235] sdf: detected capacity change from 0 to 1000204886016\r\nDec  2 14:36:20 srv-it kernel: [800099.758977] ata6.00: disabled\r\nDec  2 14:36:38 srv-it kernel: [800117.854892] ata6: exception Emask 0x10 SAct 0x0 SErr 0x4090000 action 0xe frozen\r\nDec  2 14:36:38 srv-it kernel: [800117.854950] ata6: irq_stat 0x00400040, connection status changed\r\nDec  2 14:36:38 srv-it kernel: [800117.854997] ata6: SError: { PHYRdyChg 10B8B DevExch }\r\nDec  2 14:36:38 srv-it kernel: [800117.855043] ata6: hard resetting link\r\nDec  2 14:36:39 srv-it kernel: [800118.576295] ata6: SATA link down (SStatus 0 SControl 300)\r\nDec  2 14:36:39 srv-it kernel: [800118.576306] ata6: EH complete\r\n<\/pre>\n<p>&nbsp;<br \/>\nAhora a conectarlo, conectamos el disco y luego ejecutamos:<\/p>\n<ol>\n<li>Levantamos el raid ya configurado:\n<pre>mdadm -A \/dev\/md0<\/pre>\n<\/li>\n<li>Verificamos s\u00ed se levanto:\n<pre>cat \/proc\/mdstat<\/pre>\n<\/li>\n<li>En caso de que figure (auto-read-only) ejecutamos:\n<pre>mdadm --readwrite \/dev\/md0<\/pre>\n<\/li>\n<li>Ya podemos montar la unidad:\n<pre>mount -a<\/pre>\n<\/li>\n<\/ol>\n<p>Finalmente quedar\u00eda algo as\u00ed:<\/p>\n<pre>\r\nroot@server:~# mdadm -A \/dev\/md0\r\nmdadm: \/dev\/md0 has been started with 1 drive (out of 2).\r\nroot@server:~# cat \/proc\/mdstat\r\nPersonalities : [raid1]\r\nmd0 : active (auto-read-only) raid1 sdf[1]\r\n      976761424 blocks super 1.2 [2\/1] [_U]\r\nroot@server:~# mdadm --readwrite \/dev\/md0\r\nroot@server:~# cat \/proc\/mdstat\r\nPersonalities : [raid1]\r\nmd0 : active raid1 sdf[1]\r\n      976761424 blocks super 1.2 [2\/1] [_U]\r\nroot@server:~# mount -a\r\nroot@server:~# dfc\r\nFILESYSTEM               (=) USED      FREE (-) %USED AVAILABLE     TOTAL MOUNTED ON\r\n\/dev\/md0                 [==------------------]    5%    870.1G    916.9G \/mnt\/md0\r\nroot@server:~#\r\n<\/pre>\n<p>Log de conexi\u00f3n:<\/p>\n<pre>\r\nDec  2 14:50:21 server kernel: [800938.675339] ata6: exception Emask 0x10 SAct 0x0 SErr 0x4040000 action 0xe frozen\r\nDec  2 14:50:21 server kernel: [800938.675397] ata6: irq_stat 0x00000040, connection status changed\r\nDec  2 14:50:21 server kernel: [800938.675444] ata6: SError: { CommWake DevExch }\r\nDec  2 14:50:21 server kernel: [800938.675485] ata6: hard resetting link\r\nDec  2 14:50:27 server kernel: [800944.425243] ata6: link is slow to respond, please be patient (ready=0)\r\nDec  2 14:50:28 server kernel: [800945.934056] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)\r\nDec  2 14:50:28 server kernel: [800945.950971] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20110623\/psargs-359)\r\nDec  2 14:50:28 server kernel: [800945.950980] ACPI Error: Method parse\/execution failed [\\_SB_.PCI0.SAT0.SPT5._GTF] (Node ffff8806060c0588), AE_NOT_FOUND (20110623\/psparse-536)\r\nDec  2 14:50:28 server kernel: [800945.951226] ata6.00: ATA-8: ST1000DM003-9YN162, CC4B, max UDMA\/133\r\nDec  2 14:50:28 server kernel: [800945.951229] ata6.00: 1953525168 sectors, multi 0: LBA48 NCQ (depth 31\/32), AA\r\nDec  2 14:50:28 server kernel: [800945.951618] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20110623\/psargs-359)\r\nDec  2 14:50:28 server kernel: [800945.951624] ACPI Error: Method parse\/execution failed [\\_SB_.PCI0.SAT0.SPT5._GTF] (Node ffff8806060c0588), AE_NOT_FOUND (20110623\/psparse-536)\r\nDec  2 14:50:28 server kernel: [800945.951826] ata6.00: configured for UDMA\/133\r\nDec  2 14:50:28 server kernel: [800945.951832] ata6: EH complete\r\nDec  2 14:50:28 server kernel: [800945.951931] scsi 5:0:0:0: Direct-Access     ATA      ST1000DM003-9YN1 CC4B PQ: 0 ANSI: 5\r\nDec  2 14:50:28 server kernel: [800945.952113] sd 5:0:0:0: [sdf] 1953525168 512-byte logical blocks: (1.00 TB\/931 GiB)\r\nDec  2 14:50:28 server kernel: [800945.952116] sd 5:0:0:0: [sdf] 4096-byte physical blocks\r\nDec  2 14:50:28 server kernel: [800945.952158] sd 5:0:0:0: Attached scsi generic sg5 type 0\r\nDec  2 14:50:28 server kernel: [800945.952240] sd 5:0:0:0: [sdf] Write Protect is off\r\nDec  2 14:50:28 server kernel: [800945.952245] sd 5:0:0:0: [sdf] Mode Sense: 00 3a 00 00\r\nDec  2 14:50:28 server kernel: [800945.952286] sd 5:0:0:0: [sdf] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA\r\nDec  2 14:50:28 server kernel: [800945.968907]  sdf: unknown partition table\r\nDec  2 14:50:28 server kernel: [800945.969128] sd 5:0:0:0: [sdf] Attached SCSI disk\r\n<\/pre>\n<blockquote><p>Fuentes:<br \/>\n<a href=\"http:\/\/blog.kihltech.com\/2012\/12\/sata-hotswap-drive-in-mdadm-raid-array\/\" target=\"_blank\" rel=\"noopener\">http:\/\/blog.kihltech.com\/2012\/12\/sata-hotswap-drive-in-mdadm-raid-array\/<\/a><\/p><\/blockquote>\n","protected":false},"excerpt":{"rendered":"<p>Contento de haber recibido, hace rato(m\u00e1s de un a\u00f1o), el nuevo gabinete para realizar hotswap nunca hab\u00eda realizado dicha acci\u00f3n. Hoy pequ\u00e9 no siguiendo una de las frase de IT: Si funciona, NO lo toques. Como se que la desgracia<\/p>\n","protected":false},"author":1,"featured_media":226,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false},"version":2}},"categories":[15,76,46,2],"tags":[24,49,117,116,114,113,115],"class_list":["post-218","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-bash","category-debian","category-os","category-scripts","tag-bash","tag-debian","tag-disconnect","tag-down","tag-hotswap","tag-mdadm","tag-sata"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"http:\/\/luliserver.com.ar\/wp-content\/uploads\/2016\/12\/IMG_8091.jpg","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p8Ixc0-3w","_links":{"self":[{"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=\/wp\/v2\/posts\/218"}],"collection":[{"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=218"}],"version-history":[{"count":8,"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=\/wp\/v2\/posts\/218\/revisions"}],"predecessor-version":[{"id":437,"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=\/wp\/v2\/posts\/218\/revisions\/437"}],"wp:featuredmedia":[{"embeddable":true,"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=\/wp\/v2\/media\/226"}],"wp:attachment":[{"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=218"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=218"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/luliserver.com.ar\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=218"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}