Netapp Commands Helpful

Filer General

Messages at screen is configured at

/etc/syslog.conf.sample

By default, there is no such file, but if user modifies this file, they will have

/etc/syslog.conf ----------- which will tell where to direct

messages at screen ( typically

/etc/messages )

Sysconfig –t ( tape information )

Source –v /etc/rc - this command reads and executes any file

containing filer commands line by line

Auto support ( user – trigger support )

Options.autosupport.doit [email protected]

Telneting to Filer

Only one user can do telnet

Options telnet

Autosupport Configuration

Filer>Options autosupport

autosupport.mailhost < >

autosupport.support.to < [email protected] >

autosupport.doit <string>

mailto:[email protected]

mailto:[email protected]

autosupport.support.transport https or smtp

autosupport.support.url < url address must be reachable >

Autosupport troubleshooting

1. ping netapp.com from filer

2. TCP 443 SSL should be open at SMTP server

SMTP server may stay in DMZ side

3. Mail relay in exchange must be specified. Filer’s host name or IP address must be specified in mail relay. Routing for netapp.com or routing by this host or routing by this ip must be enabled for filer. Filer is acting as a SMTP client. In general setup of mail system, no SMTP client is able to send the mail thru mail server to other SMTP server when host’s identity is different as far as mail id is concerned. Relaying is blocked generally.

4. Proxy server http / https must pass http url

Raid Scrub weekly

a. raid.scrub.duration 360

b raid.scrub.schedule sun@01

a. scrub to happen for only 6 hrs

b. forcing the scrub to start on Sunday at 1 am

RAID group

Vol add vol0 –g rg0 2 add 2 disks to raid group 0 of vol0

Vol options yervol raidsize 16 changes the raidsize settings

of the vol yervol to 16

vol create newvol –t raid_dp –r 16 32@36

- newvol creation with raid_dp protection.

RAID group size is 16disks. Since the vol

consists of 32 disks, those will form 2 RAID

group, rg0 & rg1

Max Raid groupsize

Raid DP 28

Raid 4 14

vol options for snapshots

nosnapdir off < default off >

nosnap off < default off >

Disk Fail/unfail

priv set advanced when disk goes bad

disk fail partially then prefail copy

disk unfail is seen when sysconfig -r

sysconfig –d is done. Somestimes it may

Disk troubleshoot just hang there, so disk fail

Priv set advanced -i <disk name> would

Led_on < 1d.16> release the disk &

Led_off < drive id > reconstruct the the RAID

group

Blink_on 4.19 ( failed disk now will be orange )

Blink_off 4.19

Spare disks in vol

Vol status -s

FAS 270 ( this must be done, otherwise not seen )

Priv set advanced

Disk show –v ----- to see who owned it. If this has come from

Another filer, disk block header needs to

Remove. For that

Disk unfail <disk id>

Disk assign 0b.23

Fcadmin device_map

If drive not shown in filer view

Filer> storage show disk -p

Zeroing disks

Priv set advanced

Disk zero spares --- to zero out the data in all spares

Sysconfig –r ( will show % of zero disk ) - spares disks

R100 & R150 Disk Swap

1. find bad disk , identify it

2. type disk swap < disk id >

3. Remove disk

4. Wait 20 sec

5. disk swap again

6.insert new disk

7. wait 20 sec to rescan

Out of inodes

1. Check % used of inodes by

Filer> df –i

2. to increase

Filer> maxfiles < vol name > <max>

NVRAM

Battery check

Filer> priv set diag

Filer> nv

=> should show battery status as OK and voltage as

NVRAM3 6V

Raidtime out in options raid controlls ( 24 hr ) the trigger when bat low

In 940s – NVRAM5 is used as Cluster interconnect card as well, “two in one” on slot 11

Time Deamond

(port 123, 13, 37 must be open)

When there is large skew, lot of messages from

CfTimeDaemon : displacements /skews:10/3670,10/3670, 11/3670

Because of this hourly snapshot creation also fails or in progress message appears.

Because of timed.max_skew set to 30 min, we may see above message in every 30min- 1 hr

If we set this to 5s and see how skew happening – if we see lot of skew messages (once we turned ON to timed.log ON ), MB replacement may require.

For temporary do

Cf.timed.enable ON on both cluster filers and watch those off errors

Checking from unix host

# ntptrace –v filername

From filer check

Filer>options timed

( check all the options of this )

From filer view => set date and time : Synchronize now < ip of NTP server > => do synchronize now and check NTP from unix host.

Tip : if there are multiple interfaces in filer, make sure that they are properly listed in NIS or DNS server – same host name , multiple ip addresses may require

BPS ( Blocks Per Sector ) of Disk

Block Append Checksum requires each disk drive to format it to 520 or 512 BPS per sector This provides a total of 4160 bytes in 8 sectors. This space is broken into two parts. First part is 4096 bytes ( 4K - the WAFL file size ) of file system data. The remaining 64 bytes contain the checksum data for previous 4096 bytes. In this manner, the checksum block is appeneded to each block of data.

Enviromental Status

The top line in each channel says failures to yes , if there is any.

Subsequent messages should say

Power

Cooling

Temperature

Embedded switching [ all to none ]

( if there is no problem )

Volume

Vol options vol0

Vol status vol0 -r ( raid info of volume )

Sysconfig –r

Vol options vol0 raidsize 9

Vol add vol0 <number of disk >

Vol status –l ---- to display all volume

Aggr Volume creation

Filer> create aggr1 10

Filer> vol create log1 aggr1 20g

When vol is gone bad

Vol wafliron start <vol name> -f

To list broken disk in volume

Vol status –f

Sysconfig –r will tell the failed disks

Double Parity

Vol create –t raiddp –r 2 ( minimum of two )

(there are two parity disks for holding parity and double parity data)

Enviroment status – like temp/shelf issues

Environment chesis list_sensors

Environment dump

RSH options - rsh access of fier

Options rsh.enabled on

Adminhost needed to add to do RSH ( can be done from filer

View ) - not root. RSH sec settings must be set with either ip or hostname, but with matching username for logon accounts ( not root, but the domain admin account )

RSH access from unix host

# rsh –l root <console p/w> <ip of filer> “<command>”

( add this unix host in /etc/hosts.equiv file – similar for windows host as well )

( this command can be corned in unix to make it scheduled )

RSH Port 514 / TCP

Registry Walk

Filer> registry walk status.vol.<vol name>

NFS

Scheduling any job at filer

From windows host ( admin host ), enable rsh ( windows 2003 box )

C:\> rsh sim –l root –n sysconfig –r gave the output result ( sim is filer )

Filer http access

1. license http

2. httpd.enable ON

3. httpd.rootdir xxxx ---- location like /vol/vol0/<share path or

qtree >

Volum performance Optimization

Vol options volname minra ON

(minimal read ahead )

P/W

To change admin host administrator’s p/w

Filer>passwd

Filer>login administrator

Filer>new password:…..

To change root p/w

1. attach to console – straight console

2. press Ctrl-C while booting

3. On the menu choose option 3 – password change - root

Ctrl-C - boot menu options

1. Password reset --- root

2. Disk Initialize and destroy and setup new filer

New filer setup

Software get url –f filename

Software install url

Enviroment

Environment status all

Previous ONTAP on flash

Priv set diag

Version –b --- will show the contents in flash

Previous firmware upgrade of disks

Priv set advanced

Filer*>disk_fw_update

Quotas

Lines in /etc/quotas

/vol/vol0/testftp tree 10m

WAFL stuffs

Vol walf iron - checks the vol in wafl level

Wafl check ( when inconsistencies happen, when vol

becomes restricted all of a sudden )

to correct for inconsistencies volume

1. Ctrl –C while boot

2. Options – Selection ? WAFL_Check -z

For slow access or backup or performance issues

Filer> wafl scan masure_layout vol0

Filer> wafl scan measure_layout /vol/vol0/filename

Filer > wafl scan status [vol|file] ---- to view

NFS General

/etc/exports

/vol/test -rw,root=sun1

/vol/vol1 rw,root=sun1

#mkdir /mnt/filer

#mount filer1:/vol/vol1 /mnt/filer

/etc/rntab - maintains the mount point

/etc/hosts - name and IP address

/etc/nsswitch.conf - resolution order file

Filer> exportfs

Filer > rdfile /etc/exports

filer> exportfs –a

filer>exportfs –I –o rw=<ip address>, root=<ip address>

NFS troubleshooting

Wcc –u <unix user> ---------- unix credential

>exportfs -c host pathname ro|rw|root #checks access cache

for host permission

>exportfs -s pathname # verifies the path to which a wol is

exported

>exportfs -f #flush cache access entries and reload

>exportfs -r #ensures only persistent exports are loaded

NFS error 70 - stale file handle

>vol read_fsid

# mount --- will display what protocol being used for mounting ( in unix host )

# mount –o tcp < >

Qtree security

Portmap –d

Rpcinfo –p < filer ip >

Lock Manager Release

Priv set advanced

Sm_mon –u < NFS_client_hostname>

While changing the mode

chmod 4710 oidldapd

chmod: changing permissions of `oidldapd': Input/output error

If I look in /var/log/message I see the following error:

Mar 30 19:44:59 bilbo kernel: nfs_refresh_inode: inode number mismatch

Mar 30 19:44:59 bilbo kernel: expected (0x950485c3/0x9b7609), got (0x950485c3/0x7d0b11)

Told customer to get rid of the nosuid on the exports file and that solved the issue.

Permission Denied : File handle

67000000 6ad77710 20000000 107754a 99f750f 84ce0064 67000000 6ad77700

First two numbers FSID

Next three : FID, Inode, FID

Next three : FID export point

Now, inode is different for different volume

It is found by

Priv set diag

Vol read_fsid vol0

=> gives hex number – should match any number above so that it indicates, file of which volume has problem. Hex number can be converted to decimal value as well

In unix side

# find –inum <decimal value >

# find /mnt/cleearcase –inum _________

( checking FID for above mount point )

# /etc/mnttab

( look here to find that number as well )

# ls – li prints inode numbers – in decimal – convert that to

hex

# find . –inum < number > print

( Sometimes, vol fsid number found must be reversed to get the exact place of innode )

General Permission Problems

Check the export permissions

Check the local unix system – file level and owner level

Permission and also qtreee security

( Sometimes filer permission comes to stay on top of local permission at unix box, so that it cannot be seen – they will become hidden )

To find use

# chmod

#chown

Read unix files

# cat

# more

# vi

NFS Performance

Pktt – start e5a , -dump e5a, pktt –stop ( all three– start to end)

Sysstat

Nfsstat –d ( displays cache statistic )

-z ( zero out the stat )

-m ( mount point statistics )

Perfstat –b –f filename > perfstat.begin

Perfstat –e –f filename > perfstat.end

# time mkfile 10m test ( time it takes )

# time cp test

Windows host > sio_ntap_sol 100 100 4096 100m 10 2 a.file

b.file –noflock

CPU utilization 100 percent

Customer needs to collect and send

Perfstat –f <file name> -t 5 > perfstat.out

More detail perfstat

Perfstat –t 2 –f nasx > text.txt

Perfstat –t 2 –f nasx –P flat > text.txt

-P domains ( SMP )

~ flat

~ kahuna

~ network

~ raid

~ storage

Other NFS options

Options wafl.root_only_chown on

cifs.nfs_root.ignore_acl ON

Common NFS error messages

Nfs mount : /remote_file_system_name : Stale NFS file

handle=20

this error message means that an opened file or directory

has been destroyed or recreated

NFS error 70

File or directory that was opened by NFS client was either removed or replaced on the NFS filer server

Locked file findings in NFS

Filer> priv set advanced

Filer > lock_dump –h | -f ( h or f )

21048 0x00000687 : 0x00088720 0 : 0 1/3 :3 LOCK_ (0xfffffc000598, ……….)

a. 21048 is the pid of the process, check in solaris that it is running

b. take the value of 0x00000687 convert to decimal to obtain the value

( in solaris $ echo 0x000006d7=D|adb) will convert

c. to find the file

solaris $ find .inum 1751 -print

Networking Troubleshooting

Filer>Traceroute

Filer>Ping

Filer > ifconfig for IP address related issues

Filer > routed status

Filer > routed OFF

Filer > routed ON

DHCP

Filer cannot have DHCP dynamic address. It is stored in /rc file as static even if DHCP is choosen.

Packet

Netstat –i

Netstat – I <interface name like ns0,e5a etc >

Netdiag –vV

Ifstat –a - Flow control information at bottom

10/100/1GB flow etc purely switch based : what

Ever switch is set, filer takes that

Routing table of filer

Netstat –rn

Route –f -------------- to flush

Port

Netstat –a to check all open ports on filer

Netstat ----- to see all connected connections

Port numbers

514 / tcp rsh

135 tcp/udp rpc

111 udp rpc for sun

Network troubleshooting

Cannot Ping to other subnet

1. netstat –rn should have default route addresss at top

2. do routed status if no entry

3. Even if rc file shows default gateway address – add

Manually

Route add default <ip address> 1 and check above

Checking steps

> rdfile /etc/rc

> ifconfig –a

>netstat –rn ---- gateway line must be there

>routed status

>routed ON --- if gateway is not there add manually

Packet Tracing on filer

1. pktt start e0 –b 1m –i 192.168.136.130

2. pktt status e0 ( should show some traces )

3. pktt dump e0 –f/mytrace.trc

4. pktt stop all

5. file is created at C$ of filer

6. make cifs connection to filer and point to \\<filer>\C$

7.get file mytrace.trc file

8. open by ethereal or packetizer

Brocade Switch

#switchshow

# wwn

10:00:00:05:1e:34:b0:fc - may be the output

# ssn "10:00:00:05:1e:34:b0:fc" - setting the switch serial

number to wwn

MCData Switch

If direct connection works but not thru mcdata, verify that OSMS is licensed and enabled.

> config features show

> config features opensysMS 1

> storage show switch

Switchshow

Cfgshow

Portdisable

Portenable

Switchdisable

Portperfshow

CIFS

CIFS setup

Cifs setup

Cifs configuration files

/etc/cifsconfig_setup.cfg

/etc/usermap.cfs

/etc/passwd

/etc/cifsconfig_share.cfg

Cifs general

Cifs shares

Cifs access permission

Cifs restart

Cifs shares eng

Cifs shares –add eng /vol/cifsvol/eng

Cifs access eng full control

Cifs sessions

Cifs sessions –s

Cifs terminate –t 10

Priv set advanced

Cifs perfdc add <domain name > <ip address>

Cifs perfdc pdc add <pdc ip address>

Cifs homedir load # 7.0 load to registry

Cifs nbalias

Cifs testdc

Cifs domaininfo - also check /etc/rc file

Cifs.trace-login ON - to trace cifs issues

CIFS performance

Cifs stats

smb_hist -z

sysstat –c 15 2 ( 15 iterations every 2 seconds )

statit

WAFL_susp

Ifstat -a

netstat –m -r -i ( can be used any one )

netdiag –v, -sap

cifs sessions

cifs performance optimize

options cifs.oplocks.enable ON

options cifs.tcp.window_size 64240

options cifs.max.mpx 253

options cifs.neg_buf_size 65340 - max

( 32K + 260 =~ 33028 this number can also be set )

Check switches to enable forwarding mode immediately

# set spantree portfast module/port enable

options cifs.oplocks_opendelta 0

( if client disconnects too much after this change, change this back to 8 (default ))

Cifs homedirctory

1. volume snapvol is created

2. qtree is created as root of this vol => snapvol ; sec is unix

3. share is created as snaphome of this qtree as

/vol/snapvol/home with everyone/full control

4. options cifs.home-dir /vol/snapvol/home

Options cifs.home-dir-namestyle <blank>

5. edit /etc/cifs_homedir.cfg file and add at the end

/vol/snapvol/home

CIFS troubleshooting NT4 domain

Cifs setup error : Filer’s security information differs from domain controller, CIFS could not start

Sol :

NT4 PDC/BDC : Server Management – Delete the account, recreate the account and rerun the setup.

NT4 PDC and BDC secure channel communication/verification

BDC c:> netdom bdc \\bdcname /query

CIFS troubleshooting

Wcc –s domain\name -----windows – match with

/etc/lclgroups.cfg file - any

changes here requires reboot

Wcc –u username --------------unix

Cifs domaininfo - tells dns entry

Rdfile /etc/rc --------- will have dns info

Options wafl

Should see unix

Pcuser

/etc/usermap

/etc/passwd these two files are read at the first time

Cannot Ping DNS server

A.

1. Enter the host address in dns

2. Make sure that there is no deny/untrusted entry in /etc/rc file3. Check the filer view - > Networking -> DNS entry4. If qtree is created and shared for CIFS access, make sure that qtree settings are correct, otherwise may get access denied error

B.

1. Check DNS servers, must point to itself and must have at least 4,5 services - AD

C.

1. Check where currently pointing to ( DNS )

Filer> priv set diag

Filer> registry walk auth

If requires to rerun cifs setup, this registry can be deleted

Filer> registry deltree auth

D.

Net view \\filername should show all shares from windows side and cifs shares should show from filer side

But, when share is accessed from windows machine, we get No Network Provider Present. Ping works, iscsi works, iscsi drives are OK – can access. But, cifs shares does not work. In filer side we see ‘Called name not present ( 0x82). Cifs resetdc also gives the same message.

Check :

a. If filer and windowsdc is rebooted at the same time because of power failure this is seen. Filer needs to come first and then DC

b. make sure that there is no virus related activities goin on that host. Virus scan to windows host or filer can also make this happen

When trust is there

When trust is newly established – No Logon Server available may come while accessing.

Cifs resetdc will make it work. Also in some permission issues.

Disable WINS on interface e0 ( if requires to go by pure DNS only )

Filer> ifconfig e0 –WINS

( so that filer do not talk to WINS server )

Process to find CIFs problem

Cifs shares should see everyone full control

Qtree security NTFS

Check options wafl

< > blank

< > unix

< > pcuser

Check /etc/usermap.cfg

/etc/passwd file

/vol/test - check this is UNIX or NTFS

When WINS address is changed

Options cifs.wins_servers ( ip address, , , ) ---- to view WINS

Cifs resetdc

Common Cifs issues - cannot access , access denied

1. time lag between pc and filer ( change from filer view )

2. qtree security [unix | ntfs | mixed ] - change temporarily

From ntfs to unix and

back to ntfs or ntfs to

mixed and back to ntfs

(when folder is mapped…in its drive letter you do not see security tab…..as well.)

Cifs Options

Cifs.show_snapshot ON

Options cifs.netbios_aliases.names --- alternate names of

Filer

Wafl.net_admin_priv_map_to_root ON*

Options cifs.trace_login ON

* to take ownership of file by windows top level administrator when file is created from unix side and has only unix ACLs

CIFS + NFS both

Scenario A

1. qtree in vol is created with mixed sec

2. share that qtree

3. groupwise users access in unix are defined in /etc/group file

/etc/group - > is in unix side. Client or NIS server

Eng::gid:khanna, Uddhav

In client side

Ls –l file / directory listing

Chmod

Chgroup

Chown

( to see both permission in cifs shares – permission from unix and permission from windows use secureshare access )

4. In windows create group and give access

5. /etc/usermap.cfg file is used to map user accounts in windows and their corresponding account in unix to access/manage resources

Win unix

- <= - (unix to windows)

- >= - (windows to unix)

- == - (both)

Test\* == - ( all users of test windows domain)

Domain\<user> => root

( if the user is not able to see home directory but all other users folders ; CIFS restart and access home )

6. when file is created in that cifs directory or nfs mounted place, the ownership is maintained by who ever created it and access is granted by usermap.cfg file

7. Make sure that

Wafl.net_admin_priv_map_to_root ON

( sometimes permissions are locked and some files gets corrupted; while accessing it says do not have access or encrypted. Every other files works fine. In this case changing

Options cifs.nfs_root_ignore_acl from off to ON and

Change the permission from NFS mounted side -unix

to Chmod 777 and access file. Change back to OFF.

Will work after this all the time

(this was the cause when user upgraded from 6.4 to 6.5 and some files in mixed qtree’s folders were not able to access nor change the permission from even root user from NFS side. Above permission reset made it work.

Scenario B

1. Qtree is created its security is unix

2. Share is created of that qtree – so location is the same

3. Cifs client cannot chdir into directory if the user has execute

Permission – d-wx-wx-wx eg MODE == 111. User gets

NT_Status_access_Denied message when accessed

4. If the user is granted read only – MODE == 444 ), chdir is

Successful.

CIFS audit

Options Cifs.audit.enable ON

Cifs.audit.file_access_events.enable ON

Cifs.audit.logon_events.enable ON

Cifs.audit.logzie 524288

Cifs.audit.saveas /etc/log/adtlog.evt

Filer > cifs audit save –f

Read /etc/log/adtlog.evt as event log thru windows

CIFS errors

LSAOpenPolicy2 : Exception rpc_s_assoc_grp_max exceeded

Veritas Backup Exec 9.1 : mycomputer -> shares -> sessions shows Veritas Backup Exec Administrative account connections for every share in filer. One connection per share and it grows each and every day as well as stays there each and everyday. This must be wiped out.

Virus Scan

Vscan ---- to see the status of virus scan

Vscan on

Vscan off

Vscan options

Vscan scanners

Vscan options client_msgbox [on|off]

Vscan scanners secondary_scanners ip1 [ip address]

Fpolicy

Fpolicy show

Fpolicy enable

Fpolicy options

Fpolicy server

Quotas

rdfile /etc/quotas

Cluster Prerequisite

Volume option create_ucode ON

Options coredump.timeout.enable ON

Options coredump.timeout.seconds 60 or less

Cluster

Cf disable

Cf enable

Cf status

Partner cifs terminate –t 0

Cf giveback

F1 F2

Cf takeover Can shutdown

When comes up

Waiting for giveback

from partner

cf giveback

Sometimes, due to active state, this may not run. Make sure that no cifs sessions are running. Also snapmirror should be off

San FCP

Switch>cfgshow

>fcp show cfmode (standby,partner,mixed)

>fcp set cfmode mixed

>fcp show adapters

>fcp show initiators

>fcp setup

>fcp set cfmode [dual_fabric | mixed | partner | standby ]

>fcp nodename

>fcp config

>fcp status

>fcp start

>fcp config 10b

>igroup show

>fcp stats vtic

( virtual target interconnect adapter )

>fcp stats 5a

>sysstat –f 1

Igroup show

Lun show –m

Lun show -v

/usr/sbin/lpfc/lputil - to verify the bindings

/opt/NTAPsanlun/bin/create_binding.pl –l root –n <filer ip>

/kernel/drv/sd.conf (make sure that target id and adapters are here)

Lputilnt - utility from windows host attach kit from Netapp

#san lun show

#devfsadm - to allow discovery of the new lun

#newfs /dev/rdsk/c1t1d0s6

#sanlun fcp show adapter –v

#reboot -- -r

#sanlun

Igroup bind <initiator group> <portset>

Igroup unbind <initiator>

OSSV and VSS

C:\> vssadmin list shadows

C:\> vssadmin list writers

C:\> vssadmin list providers

LUN

Lun create

Lun setup

Lun show –m, -v

Lun stat –a –o –i 2

Lun destroy -f < lun path > ( the –f command destroy the lun even if

it is mapped )

lun move

lun map | unmap <lun path><initiator group>[<lun ID>]

lun online

priv set diag

lun geometry

SNAP drive LUN creation process

1. create qtree

2. share qtree

3. create lun – snap drive can be used – so that lun is created inside qtree

(if qtree is not set properly, cannot access cifs shares – access denied error message happens )

LUN restore from snapshot (snap restore of lun – snap restore licensing req )

Filer > snap restore –t file –S snap1 /vol/lunvol/lun1/lun1.lun

Q asked – Choose Y ; Proceed => Y

Filer> lun unmap <lun path> <initiator group>

Filer > lun map <lun path> <initiator group> [lin id]

Filer > lun online <lun path>

When volume, qtree,files their space reserve is disabled by default, to change this – we must do:

Vol options vol1 create_reserved on | off

Lun create –o noreserve -f ( overrides the default settings on the

volume , including the file level )

lun set reservation

Solaris lun increase

# dd if=/dev/zero of=/dev/rdsk/clt0d1s2 count=1 bs=512

#format clt0d1

Snapshot of LUN

Rws is the file created when snapshot of LUN is taken. 124 event ID is generated by SnapDrive. When deletion of this snapshot LUN is tried 134 is created as well. When there is busy snapshot, other snapshot may hang and 134 is also generated

124 - > 249 - > 134 can be seen

( must see kb2370)

NDMP copy of LUN

Ndmpcopy –da root:netapp /vol/vol0/lun/test.lun 10.1.1.1:/vol/vol0/lun/test.lun

( lun files can only be restored to either root volume or qtree root directories )

( Also, when the lun is copied, it may not be full, so it my go fast while copying )

After this – on destination :

Dest filer > reallocate start –o –n lunpath

LUN backup from snapmirrored volume

1. on both source and destination create_upcode, convert_upcode ON

2. from destination filer : snapmirror update [options] <dest_filer : dest_vol>

3. On the source filer : lun share <lun path> read

4. run snapmirror update command

iSCSI

iscsi show interface

iscsi

fcp

iswt interface show iswta --- shows sessions and its initiator information

( iscsi software target )

igroup

iscsi show initiator

iscsi stats

sysstat –i 1

iscsi config

iscsi status - to make sure that iscsi is running, also check that

iscsi licensing is enable at filer

iscsi windows troubleshooting

iscsicli – command line version from Microsoft

SuSe iscsi LUN setup – Chap authentication

1. filer > iscsi security generate

2. filer> iscsi security add –I initiator –s method –p password –n inname [ -o outpassword –m outname]

( particular initiator connection )

OR

Filer > iscsi seurity default –s method –p inpassword –n inname [-o outpassword –m outname ]

( any initiator connection ) [[ only this one works]]

Troubleshooting

1. filer > iscsi config

2. linux # /etc/iscsi.conf

3. linux # /etc/fstab.iscsi

4. linux # uname –r

5. linux # cat /etc/issue

6. filer > iscsi show initiator

7. filer > iscsi security show

8. linux # cat /etc/initiatorname.iscsi

Iscsi private network connection

Filer> iswt interface show

Filer> iscsi show adapter

Filer> iswt session show –v iswta

( will show tcp connection – ip addresses or )

Now to change this to use private network only

a.Snapdrive -> iscsi management -> disconnect

b. From filer disable iscsi on public nic

iswa disable adapter < >

c. Then reconnect and use prive ip from snap drive

Space Reservation

Df -r

.snapshot

Hourly snapshot create failed kb 4764

See Time Deamond at Filer General as well

to see snapshots by windows client

check two things

a. vol options vol0

nosnap = off, nosnapdir = off < default >

These should be off , when turned ON, cifs

windows client cannot access this and restore,

they can see it

b. options cifs

cifs.show_snapshot ON

To get access of \\<ip of filer>\vol0\.snapshot - from

windows cifs access host

vol0 must be shared, otherwise cannot access

\vol0\.snapshot

Nfs snapshot

.snapshot directory is shown only at the mount point, although it actually exists in every directory in the tree

Cifs snapshot

.snapshot directory appears only at the root of the shared.

Priv set advanced

Snap status

( blocked owned = x 4K = )

Snap list

(generally snap reserve is 20 % )

Solaris troubleshotting for lun

Solaris_info [-d <directory name>][-n<name>]

Snap Drive troubleshotting tool

SnapDrvDc.exe

Snapdrive snapshot lun restore from mirror site

1. Break mirror

2. Check that lun is online

3. if using by terminal services and ge the Failure in Checking Policies error , Errro Code : 13040, then log off, and log back in or if this does not work, reboot the windows host.

Single File Snap Restore ( SFSR ) is done before snapdrive makes the connection. During this time snap drive virtually does not work and issues 13040 error.

No other lun restore can be done from same host. As SFSR is going on in background sol is : wait patiently. Log off and log back in after while, the drive should come.

Snap restore

Volume Restore

Snap restore –t vol path_and_vol_name

Snap restore –t vol –s snapshot_name path_and_vol_name

File restore

Snap restore –t file path_and_file_name

Snap restore –t file –s snapshot_name –r new_path_and_file_name path_and_file_name

Snapshot restore

Snap restore –t file –s winblocktemp /vol/winblocks/winblocksum

Qtree or directory restore

Snap restore –f –t file –s < snapshot > /vol/vol0/<directory name> - to restore for directory

Vol

Vol status –b

vol create vol1 2

vol restrict vol1

vol copy start vol0 vol1

vol online vol1

snap list vol1

… snapshot_for_volcopy.0

snap create vol1 snap1

Snap Mirror

/etc/snapmirror.conf

vol status –b vol1 (size in blocks)

vol status vol1

options snapmirror.access host=filerA

filerB>vol restrict vol2

>wrfile /etc/snapmirror.conf

filerA:vol1 filerB:vol2 - * * * * (min hour day-mon day-week)

filerA:vol1 filerB:vol2 – 45 10,11,12,13,14,15,16 * 1,2,3,4,5

snapmirror on

vol status –v

filerB>snapmirror initialize –S filerA:vol1 filerB:vol2 #baseline data transfer

snapmirror status

snapmirror status –l more detail info

snapmirror off

snapmirror break filerb:vol2

snapmirror on

snapmirror quiesce filerB:/vol/vol0/mymirror (break a qtree snapmirror)

snapmirror resync –S filerB:vol2 filerA:vol1

----

snapmirror update filerb:vol2

snapmirror off #disable snapmirror

snapmirror on #resume snapmirror,reread /etc/snapmirror.conf

snapmirror break vol2 # converts a mirror to a read/write vol or qtree on dest filer

snapmirror destinations -s source_volname

snapmirror release vol1 filerc:vol1

snapmirror status -l vol1

for qtree:

snapmirror quiesce mirror_qtree

snapmirro break mirror_qtree

Breaking snapmirror

1. snapmirror quiesce < destination path> --- check from

Snapmirror.conf

file

2. snapmirror off

3. snapmirror break < destination path>

To Resume the operation

Have to resync

snapmirror store #initialize a volume sanpmirror from tape on source vol

snapmirror retrieve # on mirror vol

Synchronous Snapmirror

/etc/snapmirror.conf

filera:/vol1 filerb:/vol2 - sync

#multi path

src_con = multi()

src_con:/vol1 dest:/vol2 - sync

#src_con = failover()

Steps to create Mirror

1. Enter the license on both

2. user snapmirror.access option to specify the destination filer

3. on the destination filer , edit /etc/snapmirror.conf file

4. On both source and destination filers enter snapmirror on

command

5. on the destination filer, run snapmirror initialize <destination > command

Requirement

Destination vol must be restricted

Everything in destination will get deleted once initiazlied

snapmirror optimization

filer > options snapmirror.window_size 199475234 (default )

(this will cause large brust of packet – does not work

well for WAN. May cause large packet drops resulting

in the termination of snapmirror transfer or resulting

very low throughput )

To change this

Dest filer > options snapmirror.window_size < between 8760

~ 199475234 )

Window size calculation

Window size = roundtrip delay * desired delay

Eg: for 10 mbps and RTD 100 millisec

(100/1000)*10,000,000 /8 = 125,000

Options snapmirror.delayed_acks.enable off

Snapmirror problem

On the source filer

Snapmirror source transfer from <vol> to <destination filer>:<vol. : request denied, previous request still pending

Socket connect error : resource temporarily unavailable

Sol : On Destination

1. make sure that vol is there

2. other source is pingable

Destination mirror filer> snapmirror abort netapp01:vol1 pcnetapp01:vol1 OR snapmirror abort netapp01 pcnetapp01

Destination filer> snapmirror status

( see transfer has stopped )

Destination filer> snapmirror resync –s netapp01:vol1 pcnetapp01:vol1

Snapvault

>options snapvault.enable on

>options snapvault.access host=name

baseline qtree

>snapvault start –S filer:/vol/vol1/c1-v1-q1 vault:/vol/volx/t1-

v1-q1

>snapvault modify -S src_filer:qtree_path

>snapvault update dest

>snapvault snap sched <volume> <snapname> count@day_list@hour_list

>snapvault snap sched vol1 sv_1900 4@mon-sun@19

>snapvault snap unsched

>snapvault snap create #manually create a snapshot on the primary or secondary

#snapshot name must exist

>snapvault restore –s snapname –S srcfiler:/vol/volx/qtree

>snapvault stop destfiler:/vol/volx/qtree

>snapvault abort dest_qtree

>snapvault release src_qtree dest_qtree

>snapvault status

>snapvault start –r <source qtree> <destination qtree>

Snapvault troubleshooting

Ifa backup relationship from OSSV is created and then deleted from secondary, any attempt to recreate it fails with error message:

“Transfer aborted: the qtreee is not the source for the snapmirror destination”

Example

Twain*> snapvault start –s fsr-pc1:E:\smalldir /vol/tinysmalldir

( error at console : snapvault : destination transfer from fsr-pc1:E:\smalldir to /vol/tiny/smalldir : the qtree is not the source for the snapmirror destination

Transfer aborted : the qtree is not the source for the snapmirror destination

On the primary log : error : E:\smalldir twain:/vol/tiny/smalldir Invalid qtree/snapshot requested

Log: e:\smalldir twain:/vol/tiny/smalldir unexpected close getting qsm data

To workaround

Release the relationship on primary using

Snapvault.exe release e:\smalldir twain:/vol/tiny/smalldir

To see what relationships are releasable type

Snapvault.exe destinations

backup with DFM

>options ndmpd.enable on

>options ndmpd.access dfm-host

> options ndmpd.authtype <challenge | plaintext >

Non root user get ndmp password as

> ndmpd password <user name >

> ndmp password ……..

add snapvault license>options snapvault.enable on>options snapvault.access host>options ndmpd.preferred_interface e2 #optional importing existing relationship-just add the relationship-schedule/retention not imported DFM and NDMPa. First st this at filerOptions ndmpd.enable ONOptions ndmpd.access < dfm host > b. While DFM is downloaded and installedPrimary storage system Primary System Name NDMP user < root > , if no other users are defined NDMP p/w < > c. Telnet to port 10,000 and make sure that it can talk and not blocked. Diagnosis between DFM host and FilerAt host where DFM is downloadedC:\> dfm host diag < primary filer >

DFM database files of windows host machineC:\> dfm database get dbDir c:/Program files/Network Appliance/Data Fabric Manager/DFM/Data dblogDir dbCacheSize Snaplock Vol create src1 –L 2 (at this point question is asked , read that carefully, this volume cannot be deleted. It is one time)vol create dst1 2vol status ( you will see snaplock compliance vol here )Snapmirror initialize –S giardia:src1 –L giadia:dst1 VIF Create stepsa. vif create vif1 e0 e7a e6b e8 --------single mode ORVif create multi vif0 e4 e10 ---- multi mode b. ifconfig vif1 < ip of vif > netmask 255.255.255.0 mediatype 100tx -fdc. update /etc/rc filed. reboot Tip 1 check filer > routed status filer > routed ON Tip 2If there is 3 port ( eg : 2 Gig and 1 100 bast T Ethernet ) so that e0 ( default – 100 base T ) – e0 must be turned off Vfiler If the hosting filer administrator does not have CIFS or NFS access to the data contained in V filers, except for that in Vfiler0. After storage unit is assigned to a Vfiler, the hosting filer administrator loses access ot htat storage unit. The Vfiler administration gains access to the Vfiler by rsh to the Vfiler’s IP address. As hosting filer administrator, before you create a Vfiler with the /vol/vol1 volume, you can configure the /etc/exports file so that you can mount the /vol/vol1 volume. After you create the Vfiler, an attempt to mount the /vol/vol1 volume would result in the Stale NFS file handle” error message. The Vfiler administrator can then edit the Vfiler’s /etc/exports file to export /vol/vol1, run the exportfs-a command on the Vfiler, tehn mount /vol/vol1, if allowed.

>ipspace create vfiler1-ipspace>ipspace assign vfiler1 e3a >ifconfig e3a 0.0.0.0>ipspace destroy e3a_ipsapce>ipspace list>vfiler create vfiler2 -s vfiler2 -i 10.41.66.132 /vol/vfiler/vfiler2>vfiler status -a>vfiler status -r #running >vfiler run vfiler1 setup>vfiler stop|start|destroydoes it need to be started after setup? VFMCache locationC:\documents and settings \ all users\application data\nuview\storage\server\cache To change the location of the VFM application directory <C:\Documents and Settings\All Users\Application Data\NuView\>, which contains the cache directory: 1.Take a snapshot of the application in case there is a need to return to the working state. This can be done through VFM in the Tools menu by selecting Take Application Snapshot Now. Have the user create a snapshot and save it. 2. Save a copy of the VFM application folder < C:\Documents and Settings\All Users\Application Data\NuView> somewhere for backup purposes. 3.Exit VFM and stop the StorageXReplicationAgent service and the StorageXServer service. 4. Create a folder on a different drive on the VFM server where the application directory should reside in the future. Please use a local destination for the folder for example D:\VFMAppData. A mapped drive does not work in this situation. Create a new subdirectory called NuView in the new location. Ex: D:\VFMAppData\NuView 5. Go to the C:\Documents and Settings\All Users\Application Data\NuView directory and copy the StorageX directory to the new location created by the user under the NuView subdirectory. The new location should look something like this: D:\VFMAppData\NuView\StorageX 6. Open the registry with regedit.exe and find the HKEY_LOCAL_MACHINE\SOFTWARE\NuView\StorageX key. Add a new String Value here with the name AppDataDir and set the value data to the root of the new cache location. Ex: D:\VFMAppData 7. Close regedit and start the StorageX Server and Replication Agent services. 8. Start VFM and wait as it reads through the new cache directory and loads roots and information that were copied to the new location. Bakup media fundamentalsNdmpd should be ONTo checkFiler> ndmpd statusFiler> ndmp probe 0 [ session 0 , can be from 0 – 5 ] sysconfig –t ( will give some backup media information )mt –f nrst0a statusrestore tf nrst0a – display file list, there can be multiple backups in backup file, which is filelist

mt –f nrst0a fsf 1storage disable adapter <port>storage enable adapter <port>storage show tape supported - should show wwn if yes(sysconfig –a – will tell port and also shows if adapter card online or offline – usually slot 10)/etc/log/backup ----- log files List all the files in backupFiler> restore tf rst0a rewind (rewind the tape)Filer> mt –f rst0a fsf 6 (move the head to file list6)Filer> mt –f rst0a status ( make sure )Filer> restore –xvbfDH 60 rst0a /vol/vol0/… ( restore ) TestingDump 0f rst0a /vol/vol0/etc/usermap.cfg ----- example SCSI tape diagnostics to send vendor ( more detail messages )Filer> mt –f diag 1 -------- ONFiler > mt –f diag 0 -------------- OFF Copy & paste console messages and send to vendor ( with diag 1 – ON, all the messages will go to etc/messages like when any backup job, command is executed like mt –f, rewind, offline , erase, status diag etc ) Some issuesa.If veritas is showing RED to LTO tape devices, then reboot LTO and restart veritas servicesb.If backup is done from Veritas software, make sure that no sessions are staying back as cifs share sessions. Go to my computer->Manage->connect to filer->shares->sessions.Administrative shares of backups are seen here as sticking – not going away even after backup is complete and you see huge list here. Fiber channel backup device Filer> Fcadmin online adapter 8aFiler> Fcadmin online adapter 8bFiler > fcp show adapter filer> storage show tape Tape Drive: FPN[200300051e35353d]:0.80Description: HP Ultrium 2-SCSI Serial Number: HUM2M00009World Wide Name: WWN[5:006:0b0000:1e01ae]Alias Name(s): st0 Device State: available (does not support reservations) McData SideCNXNAS*> storage show switchSwitch: WWN[1:000:080088:020751]Fabric: WWN[1:000:080088:020751]Name: CNX01Domain: 97Type: switch

Netapp Commands Helpful

Documents

Transcript of Netapp Commands Helpful