Start a Conversation

Unsolved

This post is more than 5 years old

1165

August 15th, 2012 09:00

Too many checkpioint created for NDMP backup

Hi,

We have configurd the NDMP backups and when the backup schedule starts, it will create too many snap sure checkpoints as shown in below example.

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) < Backup type: vbb >

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: DEBUG Value: n

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: DIRECT Value: y

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: EMC_OFFLINE_DATA Value: n

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: FILESYSTEM Value: /root_vdm_22/AFS_025

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: HIST Value: y

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: LEVEL Value: 1

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: SNAPSURE Value: y

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: TYPE Value: vbb

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: UPDATE Value: y

2012-08-15 15:49:11: NDMP: 4: Session 549 (thread ndmp549) Name: VBB Value: y

2012-08-15 15:49:11: NDMP: 6: 1: source_fsid:9518 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt593-9518-1345042151

2012-08-15 15:49:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097

2012-08-15 15:52:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464

2012-08-15 15:54:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097

2012-08-15 15:57:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464

2012-08-15 15:59:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097

2012-08-15 16:02:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464

2012-08-15 16:02:59: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_18/automaticNDMPCkpts/automaticTempNDMPCkpt587-120-1345036379

2012-08-15 16:04:42: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_28/automaticNDMPCkpts/automaticTempNDMPCkpt588-2778-1345036482

2012-08-15 16:04:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097

2012-08-15 16:07:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464

2012-08-15 16:09:57: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt589-9518-1345037097

2012-08-15 16:12:44: NDMP: 6: 2: fsid:-1 mount_point:/root_vdm_22/automaticNDMPCkpts/automaticTempNDMPCkpt590-7691-1345038464

2012-08-15 16:13:47: NDMP: 3: < LOG type: 2, msg_id: 0, entry: SnapSure file system creation fails, hasAssociatedMsg: 0, associatedMsgSeq: 0 >

We are using NAS700 runing on 5.6.52 code.

There are 4 stream avaialble to take the NDMP backup but when the backup started, only on one stream the throughput  will be seen and the other 3 jobs will be in waiting state. When the 1 job completes, teh backup of 2nd job will start.

Is this behaviour on the backup because of the  checkpoints? We are using Commvault backup

674 Posts

August 15th, 2012 23:00

The datamover is able to handle upt to 4 NDMP backups in parallel.

Please check your Commvault Software.

From the provided log, this looks like there are many automaticNDMPCkpts or only there mountpoints from different filesystems (id 9518, 7691, 120, 2778)). Please clean up (delete) all unused of these Checkpoints and mountpoints.

Also test if creating a checkpoint (fs_ckpt fsname -C) manualy is working. From your other posting (unable to aquire locks) this could be a reason ... for this problem

9 Posts

August 16th, 2012 02:00

Thanks Peter.

I hve checked the mount points and found below details. Does this mean that the FS is mounted on server_2 and lbsjsh_vdmb2

nasadmin@LBSTH-NAS1CS0 ~]$ nas_fs -i id=120

id        = 120

name      = BFS_009

acl       = 0

in_use    = True

type      = uxfs

worm      = off

volume    = v7207

pool      = clar_r5_performance

member_of = root_avm_fs_group_3

rw_servers=

ro_servers= server_2

rw_vdms   =

ro_vdms   = lbsjsh-vdmb2

auto_ext  = no,virtual_provision=no

deduplication   = Suspended

ckpts     = root_rep_ckpt_120_565809_1,root_rep_ckpt_120_565809_2

rep_sess  = 371_CK200062500340_000A_7207_CK200063700624_0010(ckpts: root_rep_ckpt_120_565809_1, root_rep_ckpt_120_565809_2)

stor_devs = CK200063700624-0141,CK200063700624-0136,CK200063700624-012D,CK200063700624-0122,CK200063700624-00E7,CK200063700624-00F0,CK200063700624-00D3,CK200063700624-00DC

disks     = d44,d39,d42,d37,d23,d11,d21,d9

disk=d44   stor_dev=CK200063700624-0141 addr=c16t2l9        server=server_2

disk=d44   stor_dev=CK200063700624-0141 addr=c32t2l9        server=server_2

disk=d44   stor_dev=CK200063700624-0141 addr=c0t2l9         server=server_2

disk=d44   stor_dev=CK200063700624-0141 addr=c48t2l9        server=server_2

disk=d39   stor_dev=CK200063700624-0136 addr=c0t2l6         server=server_2

disk=d39   stor_dev=CK200063700624-0136 addr=c48t2l6        server=server_2

disk=d39   stor_dev=CK200063700624-0136 addr=c16t2l6        server=server_2

disk=d39   stor_dev=CK200063700624-0136 addr=c32t2l6        server=server_2

disk=d42   stor_dev=CK200063700624-012D addr=c16t2l5        server=server_2

disk=d42   stor_dev=CK200063700624-012D addr=c32t2l5        server=server_2

disk=d42   stor_dev=CK200063700624-012D addr=c0t2l5         server=server_2

disk=d42   stor_dev=CK200063700624-012D addr=c48t2l5        server=server_2

disk=d37   stor_dev=CK200063700624-0122 addr=c0t2l2         server=server_2

disk=d37   stor_dev=CK200063700624-0122 addr=c48t2l2        server=server_2

disk=d37   stor_dev=CK200063700624-0122 addr=c16t2l2        server=server_2

disk=d37   stor_dev=CK200063700624-0122 addr=c32t2l2        server=server_2

disk=d23   stor_dev=CK200063700624-00E7 addr=c16t1l7

We are using the backup from the replicated FS and this output is of the replicated FS.

Manual checkpoint creation is sucessfull. We have issue only with snapsure checkpoint. On my other post also, I can see that the lock porcess are majorly of either creating a snapsure checkpoint or deletion of snapsure ckpt.

Also could you let know what parameter shouls we cehck from commvault?

Thanks,

Sachin

9 Posts

August 16th, 2012 02:00

When I grab for Automatic checkpoints, below is the out put

nasadmin@LBSJSH-NAS2CS0 db]$ server_mount ALL  | grep automaticTe  

automaticTempNDMPCkpt596-2778-1345045811 on /root_vdm_28/automaticNDMPCkpts/automaticTempNDMPCkpt596-2778-1345045811 ckpt,perm,ro

automaticTempNDMPCkpt568-7561-1345012584 on /root_vdm_26/automaticNDMPCkpts/automaticTempNDMPCkpt568-7561-1345012584 ckpt,perm,ro

automaticTempNDMPCkpt568-7561-1345012584 on /automaticNDMPCkpts/automaticTempNDMPCkpt568-7561-1345012584 ckpt,perm,ro

automaticTempNDMPCkpt596-2778-1345045811 on /automaticNDMPCkpts/automaticTempNDMPCkpt596-2778-1345045811 ckpt,perm,ro

Thanks,

Sachin

No Events found!

Top