Unsolved
This post is more than 5 years old
52 Posts
0
1881
July 1st, 2013 10:00
Celerra Can't Discover Tape
Hey guys, i have a SR open, but they don't seem to be useful, so i figured i would post here to see if i can get any help.
I have a Data Mover on my celerra that can't discover its tape drives. I know the zoning & paths are good because i see the Data Mover logged into the Data Domain VTL. Also, i've reconfigured the VTL connections a couple of times, so i know that's correct & the HBA is logged into the Fibre Channel switch.
Commands i've ran:
-server_devconfig server_3 -probe -scsi -nondisks
--comes up with no devices on chain
[su118576@NASCTRL ~]$ .server_config server_3 -v "fcp portreset=2"
server_3 : commands processed: 1
command(s) succeeded
output is complete
1372695760: DRIVERS: 6: FCDMTL 2 [10.4.1] Scsi Port Bus Reset
1372695760: DRIVERS: 6: FCDMTL 2 [10.4.1] TPM Notify: st=0xa000000, flg=0x208, cmd=0x1
1372695760: DRIVERS: 6: FCDMTL 2 [10.4.1] Auto Neg Speed : In sync fmstat=0x200001f5
1372695760: DRIVERS: 6: FCDMTL 2 [10.4.1] Auto Neg Speed : Current Speed: 2Gbps
1372695762: ADMIN: 6: Command succeeded: fcp portreset=2
[su118576@NASCTRL ~]$ .server_config server_3 -v "fcp show"
server_3 : commands processed: 1
command(s) succeeded
output is complete
FCP ONLINE HBA 0: ALPA 000001 WWN: 5006016839a03039 DX2 2G
FCP scsi-0: HBA 0: ALPA 0000ef SP-a01: 5006016139a02e50 Class 3
FCP ONLINE HBA 1: ALPA 000001 WWN: 5006016939a03039 DX2 2G
FCP scsi-16: HBA 1: ALPA 0000ef SP-b01: 5006016939a02e50 Class 3
FCP ONLINE HBA 2: S_ID 14bd80 WWN: 5006016a39a03039 DX2 2G
FCP scsi-32: HBA 2: CHAINS 32 - 47 OFFLINE
FCP scsi-64: HBA 2: CHAINS 64 - 79 OFFLINE
FCP scsi-80: HBA 2: CHAINS 80 - 95 OFFLINE
FCP scsi-96: HBA 2: CHAINS 96 - 111 OFFLINE
FCP scsi-112: HBA 2: CHAINS 112 - 127 OFFLINE
FCP scsi-128: HBA 2: CHAINS 128 - 143 OFFLINE
FCP scsi-144: HBA 2: CHAINS 144 - 159 OFFLINE
FCP scsi-160: HBA 2: CHAINS 160 - 175 OFFLINE
FCP scsi-176: HBA 2: CHAINS 176 - 191 OFFLINE
FCP scsi-192: HBA 2: CHAINS 192 - 207 OFFLINE
FCP scsi-208: HBA 2: CHAINS 208 - 223 OFFLINE
FCP scsi-224: HBA 2: CHAINS 224 - 239 OFFLINE
FCP scsi-240: HBA 2: CHAINS 240 - 255 OFFLINE
FCP scsi-256: HBA 2: CHAINS 256 - 271 OFFLINE
FCP scsi-272: HBA 2: CHAINS 272 - 287 OFFLINE
FCP scsi-288: HBA 2: CHAINS 288 - 303 OFFLINE
FCP scsi-304: HBA 2: CHAINS 304 - 319 OFFLINE
FCP scsi-320: HBA 2: CHAINS 320 - 335 OFFLINE
FCP scsi-336: HBA 2: CHAINS 336 - 351 OFFLINE
FCP scsi-352: HBA 2: CHAINS 352 - 367 OFFLINE
FCP scsi-368: HBA 2: CHAINS 368 - 383 OFFLINE
FCP scsi-384: HBA 2: CHAINS 384 - 399 OFFLINE
FCP scsi-400: HBA 2: CHAINS 400 - 415 OFFLINE
FCP scsi-416: HBA 2: CHAINS 416 - 431 OFFLINE
FCP scsi-432: HBA 2: CHAINS 432 - 447 OFFLINE
FCP scsi-448: HBA 2: CHAINS 448 - 463 OFFLINE
FCP scsi-464: HBA 2: CHAINS 464 - 479 OFFLINE
FCP scsi-480: HBA 2: CHAINS 480 - 495 OFFLINE
FCP scsi-496: HBA 2: CHAINS 496 - 511 OFFLINE
FCP scsi-512: HBA 2: CHAINS 512 - 527 OFFLINE
FCP scsi-528: HBA 2: CHAINS 528 - 543 OFFLINE
FCP scsi-544: HBA 2: CHAINS 544 - 559 OFFLINE
FCP scsi-560: HBA 2: CHAINS 560 - 575 OFFLINE
FCP scsi-576: HBA 2: CHAINS 576 - 591 OFFLINE
FCP scsi-592: HBA 2: CHAINS 592 - 607 OFFLINE
FCP scsi-608: HBA 2: CHAINS 608 - 623 OFFLINE
FCP scsi-624: HBA 2: CHAINS 624 - 639 OFFLINE
FCP scsi-640: HBA 2: CHAINS 640 - 655 OFFLINE
FCP scsi-656: HBA 2: CHAINS 656 - 671 OFFLINE
FCP scsi-672: HBA 2: CHAINS 672 - 687 OFFLINE
FCP OFFLINE HBA 3: ALPA 000001 WWN: 5006016b39a03039 DX2 SFP Not Present
FCP scsi-48: HBA 3: CHAINS 48 - 63 OFFLINE
1372696094: ADMIN: 6: Command succeeded: fcp show
To me, it almost looks like the FC or SCSI chain has been turned off or disabled.
dynamox
9 Legend
•
20.4K Posts
0
July 1st, 2013 10:00
what tape library are you emulating, did you also try another aux port ?
dynamox
9 Legend
•
20.4K Posts
0
July 1st, 2013 11:00
listed Online in "vtl initiator show" on DD. That does look odd that everything is listed as OFFLINE, so server_3 is you active datamover right now ?
dernsber
52 Posts
0
July 1st, 2013 11:00
Yup... that was one of the first things i did... :-\
dynamox
9 Legend
•
20.4K Posts
0
July 1st, 2013 11:00
did you try to shut/no shut the port on the switch itself ?
dernsber
52 Posts
0
July 1st, 2013 11:00
I have not tried another AUX port on the data mover.
I'm emulating a L180. The VTL works perfectly on the other Data Mover & this worked fine in the past.
The only alteration is I moved the Fibre Channel connection to a different switch. I'm just worried that there's some form of a reboot that might be required to have the Data Mover register itself with the new switch so it can talk to the switch happily.
I'm hoping there's a command that will simulate taking that port down & having it register without rebooting the Data Mover... similar to the the FCP reset command i've already ran but that didn't work....
dynamox
9 Legend
•
20.4K Posts
0
July 1st, 2013 13:00
how hard would it be for you to swap SFP/cable with server_2 (assuming that guy works) and trying in your aux0 , if nothing in aux1 on server_3 ?
dernsber
52 Posts
0
July 1st, 2013 13:00
Yeah... i'm putting that off & dreading that.... but you're right.
I'll try swapping the GBICs first then i'll try changing the ports completely.
dernsber
52 Posts
0
July 1st, 2013 13:00
Data Mover 3 is 1 of 2 active DMs, correct.
Here's the output of the initiator show:
118576@ddfitz01# vtl initiator show
Initiator Group Status WWNN WWPN Port
-------------------- -------------------- ------ ----------------------- ----------------------- ----
nasctrl_server_2 nasctrl_server_2 Online 50:06:01:60:b9:a0:30:39 50:06:01:62:39:a0:30:39 1b
Online 50:06:01:60:b9:a0:30:39 50:06:01:62:39:a0:30:39 5b
nasctrl_server_3 nasctrl_server_3 Online 50:06:01:60:b9:a0:30:39 50:06:01:6a:39:a0:30:39 1a
Online 50:06:01:60:b9:a0:30:39 50:06:01:6a:39:a0:30:39 5a
nasctrl_server_4 nasctrl_server_4 n/a n/a 50:06:01:62:39:a0:30:ce none
tsmprd01_pci2_p1_c13 tsmprd01_pci2_p1_c13 Online 20:00:00:00:c9:86:c5:4b 10:00:00:00:c9:86:c5:4b 1b
Online 20:00:00:00:c9:86:c5:4b 10:00:00:00:c9:86:c5:4b 5b
tsmprd01_pci5_p1_c7 tsmprd01_pci5_p1_c7 Online 20:00:00:00:c9:86:cf:d1 10:00:00:00:c9:86:cf:d1 1a
Online 20:00:00:00:c9:86:cf:d1 10:00:00:00:c9:86:cf:d1 5a
tsmprd02_pci2_p1_c13 tsmprd02_pci2_p1_c13 Online 20:00:00:00:c9:86:cd:69 10:00:00:00:c9:86:cd:69 1b
Online 20:00:00:00:c9:86:cd:69 10:00:00:00:c9:86:cd:69 5b
tsmprd02_pci5_p1_c7 tsmprd02_pci5_p1_c7 Online 20:00:00:00:c9:86:cc:59 10:00:00:00:c9:86:cc:59 1a
Online 20:00:00:00:c9:86:cc:59 10:00:00:00:c9:86:cc:59 5a
tsmprd03_pci2_p1_c13 tsmprd03_pci2_p1_c13 Online 20:00:00:00:c9:86:d0:f1 10:00:00:00:c9:86:d0:f1 1b
Online 20:00:00:00:c9:86:d0:f1 10:00:00:00:c9:86:d0:f1 5b
tsmprd03_pci5_p1_c7 tsmprd03_pci5_p1_c7 Online 20:00:00:00:c9:86:c6:4d 10:00:00:00:c9:86:c6:4d 1a
Online 20:00:00:00:c9:86:c6:4d 10:00:00:00:c9:86:c6:4d 5a
-------------------- -------------------- ------ ----------------------- ----------------------- ----
Initiator Symbolic Port Name Address Method
-------------------- -------------------------------------- --------------
nasctrl_server_2 EMC Celerra DM02:02 APM000730008060000 auto
nasctrl_server_3 EMC Celerra DM03:02 APM000730008060000 auto
nasctrl_server_4 auto
tsmprd01_pci2_p1_c13 Emulex LPe11002-S FV2.82a4 DV2.60k auto
tsmprd01_pci5_p1_c7 Emulex LPe11002-S FV2.82a4 DV2.60k auto
tsmprd02_pci2_p1_c13 Emulex LPe11002-S FV2.82a4 DV2.60k auto
tsmprd02_pci5_p1_c7 Emulex LPe11002-S FV2.82a4 DV2.60k auto
tsmprd03_pci2_p1_c13 Emulex LPe11002-S FV2.82a4 DV2.60k auto
tsmprd03_pci5_p1_c7 Emulex LPe11002-S FV2.82a4 DV2.60k auto
-------------------- -------------------------------------- --------------
So yes, everything looks perfect on the Data Domain side. I agree that this is Odd... very odd.
Rainer_EMC
4 Operator
•
8.6K Posts
0
July 2nd, 2013 07:00
Try rebooting the data mover
I that doesn’t help open a service request and let service dial in
dynamox
9 Legend
•
20.4K Posts
0
July 2nd, 2013 07:00
darn, since support is not providing you with anything helpful, how about brute force and failover/failback ?
dernsber
52 Posts
0
July 2nd, 2013 07:00
Yeah... that's where I'm at... its just a major outage to do either. (same 5-minute window, so i would bounce).
My other fear... what if the reboot doesn't fix it?
Thanks for the help guys!
Rainer_EMC
4 Operator
•
8.6K Posts
0
July 2nd, 2013 07:00
If you have recent code than a failover / failback is a lot less than 5 minutes
Typical these days is 1 minute for medium sized systems – 30 seconds for small and at most 2min for large
Failover / failback is twice an interruption but its typically 20 seconds less than a warm reboot on medium/large configs
Rainer
Rainer_EMC
4 Operator
•
8.6K Posts
0
July 2nd, 2013 07:00
No problem – just send a PO ☺
dernsber
52 Posts
0
July 2nd, 2013 07:00
Swapped the GBICs, no luck
Swapped the auxiliary ports, no luck
I feel like a daemon is not running or something got locked up on these puppies...
dynamox
9 Legend
•
20.4K Posts
0
July 2nd, 2013 07:00
At that point you can tell support you have exhausted all of the possible scenarios, give me a new datamover