Unsolved
This post is more than 5 years old
9 Posts
0
1165
August 15th, 2012 09:00
Too many checkpioint created for NDMP backup
Hi,
We have configurd the NDMP backups and when the backup schedule starts, it will create too many snap sure checkpoints as shown in below example.
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) < Backup type: vbb >
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: DEBUG Value: n
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: DIRECT Value: y
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: EMC_OFFLINE_DATA Value: n
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: FILESYSTEM Value: /root_vdm_22/AFS_025
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: HIST Value: y
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: LEVEL Value: 1
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: SNAPSURE Value: y
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: TYPE Value: vbb
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: UPDATE Value: y
2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: VBB Value: y
2012-08-15 15:49:11: NDMP: 6: 1: source_fsid:9518 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt593-9518-1345042151
2012-08-15 15:49:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097
2012-08-15 15:52:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464
2012-08-15 15:54:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097
2012-08-15 15:57:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464
2012-08-15 15:59:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097
2012-08-15 16:02:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464
2012-08-15 16:02:59: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_18/automaticNDMPCkpts/automaticTempNDMPCkpt587-120-1345036379
2012-08-15 16:04:42: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_28/automaticNDMPCkpts/automaticTempNDMPCkpt588-2778-1345036482
2012-08-15 16:04:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097
2012-08-15 16:07:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464
2012-08-15 16:09:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097
2012-08-15 16:12:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464
2012-08-15 16:13:47: NDMP: 3: < LOG type: 2, msg_id: 0, entry: SnapSure file system creation fails, hasAssociatedMsg: 0, associatedMsgSeq: 0 >
We are using NAS700 runing on 5.6.52 code.
There are 4 stream avaialble to take the NDMP backup but when the backup started, only on one stream the throughput will be seen and the other 3 jobs will be in waiting state. When the 1 job completes, teh backup of 2nd job will start.
Is this behaviour on the backup because of the checkpoints? We are using Commvault backup
Peter_EMC
674 Posts
0
August 15th, 2012 23:00
The datamover is able to handle upt to 4 NDMP backups in parallel.
Please check your Commvault Software.
From the provided log, this looks like there are many automaticNDMPCkpts or only there mountpoints from different filesystems (id 9518, 7691, 120, 2778)). Please clean up (delete) all unused of these Checkpoints and mountpoints.
Also test if creating a checkpoint (fs_ckpt fsname -C) manualy is working. From your other posting (unable to aquire locks) this could be a reason ... for this problem
omcsan
9 Posts
0
August 16th, 2012 02:00
Thanks Peter.
I hve checked the mount points and found below details. Does this mean that the FS is mounted on server_2 and lbsjsh_vdmb2
nasadmin@LBSTH-NAS1CS0 ~]$ nas_fs -i id=120
id = 120
name = BFS_009
acl = 0
in_use = True
type = uxfs
worm = off
volume = v7207
pool = clar_r5_performance
member_of = root_avm_fs_group_3
rw_servers=
ro_servers= server_2
rw_vdms =
ro_vdms = lbsjsh-vdmb2
auto_ext = no,virtual_provision=no
deduplication = Suspended
ckpts = root_rep_ckpt_120_565809_1,root_rep_ckpt_120_565809_2
rep_sess = 371_CK200062500340_000A_7207_CK200063700624_0010(ckpts: root_rep_ckpt_120_565809_1, root_rep_ckpt_120_565809_2)
stor_devs = CK200063700624-0141,CK200063700624-0136,CK200063700624-012D,CK200063700624-0122,CK200063700624-00E7,CK200063700624-00F0,CK200063700624-00D3,CK200063700624-00DC
disks = d44,d39,d42,d37,d23,d11,d21,d9
disk=d44 stor_dev=CK200063700624-0141 addr=c16t2l9 server=server_2
disk=d44 stor_dev=CK200063700624-0141 addr=c32t2l9 server=server_2
disk=d44 stor_dev=CK200063700624-0141 addr=c0t2l9 server=server_2
disk=d44 stor_dev=CK200063700624-0141 addr=c48t2l9 server=server_2
disk=d39 stor_dev=CK200063700624-0136 addr=c0t2l6 server=server_2
disk=d39 stor_dev=CK200063700624-0136 addr=c48t2l6 server=server_2
disk=d39 stor_dev=CK200063700624-0136 addr=c16t2l6 server=server_2
disk=d39 stor_dev=CK200063700624-0136 addr=c32t2l6 server=server_2
disk=d42 stor_dev=CK200063700624-012D addr=c16t2l5 server=server_2
disk=d42 stor_dev=CK200063700624-012D addr=c32t2l5 server=server_2
disk=d42 stor_dev=CK200063700624-012D addr=c0t2l5 server=server_2
disk=d42 stor_dev=CK200063700624-012D addr=c48t2l5 server=server_2
disk=d37 stor_dev=CK200063700624-0122 addr=c0t2l2 server=server_2
disk=d37 stor_dev=CK200063700624-0122 addr=c48t2l2 server=server_2
disk=d37 stor_dev=CK200063700624-0122 addr=c16t2l2 server=server_2
disk=d37 stor_dev=CK200063700624-0122 addr=c32t2l2 server=server_2
disk=d23 stor_dev=CK200063700624-00E7 addr=c16t1l7
We are using the backup from the replicated FS and this output is of the replicated FS.
Manual checkpoint creation is sucessfull. We have issue only with snapsure checkpoint. On my other post also, I can see that the lock porcess are majorly of either creating a snapsure checkpoint or deletion of snapsure ckpt.
Also could you let know what parameter shouls we cehck from commvault?
Thanks,
Sachin
omcsan
9 Posts
0
August 16th, 2012 02:00
When I grab for Automatic checkpoints, below is the out put
nasadmin@LBSJSH-NAS2CS0 db]$ server_mount ALL | grep automaticTe
automaticTempNDMPCkpt596-2778-1345045811 on /root_vdm_28/automaticNDMPCkpts/automaticTempNDMPCkpt596-2778-1345045811 ckpt,perm,ro
automaticTempNDMPCkpt568-7561-1345012584 on /root_vdm_26/automaticNDMPCkpts/automaticTempNDMPCkpt568-7561-1345012584 ckpt,perm,ro
automaticTempNDMPCkpt568-7561-1345012584 on /automaticNDMPCkpts/automaticTempNDMPCkpt568-7561-1345012584 ckpt,perm,ro
automaticTempNDMPCkpt596-2778-1345045811 on /automaticNDMPCkpts/automaticTempNDMPCkpt596-2778-1345045811 ckpt,perm,ro
Thanks,
Sachin