Copy files in parallel by reading what files to copy from few other files

Clash Royale CLAN TAG#URR8PPP
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;
up vote
7
down vote
favorite
I am working on a little complicated shell script for the first time and this is what it should do:
- During startup, it figures out what is my
clientidby looking athost-mapping.txtfile. If I cannot findclientidfor my hostname then I need to exit from the shell script with a non-zero status code and log error message. - Now once I have a valid
clientid, I will extract primary files from theprimary-mappings.txtfile and secondary files from thesecondary-mappings.txtfile for that validclientid. If for whatever reason, I cannot find either primary or secondary files for thatclientidfrom that file, then I will exit from the shell script and log an error message. - Now once I have valid primary and secondary files for that
clientidthen I will start copying those files in parallel usinggnu-parallelfromlocal_server. All primary files will go toprimaryfolder and all secondary files will go tosecondaryfolder. If files are not there inhold1folder on remote servers then it should be there in thehold2folder. - Now once all the files are copied, I will verify at the end to make sure all the primary and secondary files are present for that
clientidin those two folders, but if for whatever reason I cannot find those files, then I want to exit from the shell script with a message that tells me what files are missing.
My script does the job but I would like to see if there is any better or efficient way to do these things since this is my first time writing a little complicated script so I wanted to check this out. As of now, I don't have a mechanism to exit out of the shell script if I cannot find primary or secondary files for that clientid. I also don't have a mechanism to exit out of the shell script if during the verification phase some files are missing.
#!/bin/bash
path=/home/goldy/scripts
mapfiles=(primary-mappings.txt secondary-mappings.txt)
hostfile=host-mapping.txt
machines=(machine1769.abc.host.com proctek5461.def.host.com letyrs87541.pqr.host.com)
# folders on local box where to copy files
primary=/data01/primary
secondary=/data02/secondary
# folders on remote servers from where to copy files
export hold1=/data/snapshot/$1
export hold2=/data/snapshot/$2
date1=$(date +"%s")
# this will tell me what's my clientid given my current hostname
getProperty () grep "$prop_value"
# if I can't find clientid for my hostname, then I will log a message
# and exit out of shell script with non zero status code
clientid=$(getProperty)
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)"; exit 1;
# now once I have valid clientid, then I will get primary and secondary mapping
# from the "host-mapping.txt" file
declare -a arr
mappingsByClientID ()
id=$1 # 1 to 5
file=$path/$mapfiles[$2] # 0 to 1
arr=($(sed -r "s/.*b$id=[([^]]+).*/1/; s/,/ /g" $file))
echo "$arr[@]"
# assign output of function to an array
pri=($(mappingsByClientID $clientid 0))
snd=($(mappingsByClientID $clientid 1))
echo "primary files: $pri[@]"
echo "secondary files: $snd[@]"
# figure out which machine you want to use to start copying files from
case $(hostname -f) in
*abc.host.com)
local_server=("$machines[0]")
;;
*def.host.com)
local_server=("$machines[1]")
;;
*pqr.host.com)
local_server=("$machines[2]")
;;
*) echo "unknown host: $(hostname -f), exiting." && exit 1 ;;
# ?
esac
export local="$local_server"
# deleting files before we start copying
find "$primary" -maxdepth 1 -type f -exec rm -fv ;
find "$secondary" -maxdepth 1 -type f -exec rm -fv ;
do_copy() (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
export -f do_copy
# copy files in parallel
parallel -j "$3" do_copy $primary ::: $pri[@] &
parallel -j "$3" do_copy $secondary ::: $snd[@] &
wait
echo "all files copied"
# this is for verification to see all files got copied or not
# in primary and secondary folder
set -- "$primary" "$secondary"
typeset -n array
for array in pri snd; do
for num in "$array[@]"; do
name="hello_monthly_$num_999_1.data"
if [ ! -f "$1/$name" ]; then
echo "$name" not found in "$1" >&2 && exit 1;
fi
done
shift
done
date2=$(date +"%s")
diff=$(($date2-$date1))
echo "Total Time Taken - $(($diff / 3600)) hours and $(((diff/60) % 60)) minutes and $(($diff % 60)) seconds elapsed."
Here is my host-mapping.txt file that will have lot more entries. Here, value is a valid hostname and key will be a string "k" followed by some number and that number should be there in the mapping files.
k1=machineA.abc.com
k2=machineB.abc.com
k3=machineC.def.com
k4=machineD.pqr.com
k5=machineO.abc.com
And here is my sample mapping files:
primary_mappings.txt
1=[343, 0, 686, 1372, 882, 196], 2=[687, 1, 1373, 883, 197, 736, 1030, 1569], 3=[1374, 2, 884, 737, 198, 1570], 4=[1375, 1032, 1424, 3, 885, 1228], 5=[1033, 1425, 4, 200, 886]
secondary_mappings.txt
1=[1152, 816, 1488, 336, 1008], 2=[1153, 0, 817, 337, 1489, 1009, 1297], 3=[1, 1154, 1490, 338], 4=[1155, 2, 339, 1491, 819, 1299, 1635], 5=[820, 1492, 340, 3, 1156]
For example: clientid 1 has 343, 0, 686, 1372, 882, 196 primary files and 1152, 816, 1488, 336, 1008 secondary files, similarly for other clientids as well.
bash linux shell
add a comment |Â
up vote
7
down vote
favorite
I am working on a little complicated shell script for the first time and this is what it should do:
- During startup, it figures out what is my
clientidby looking athost-mapping.txtfile. If I cannot findclientidfor my hostname then I need to exit from the shell script with a non-zero status code and log error message. - Now once I have a valid
clientid, I will extract primary files from theprimary-mappings.txtfile and secondary files from thesecondary-mappings.txtfile for that validclientid. If for whatever reason, I cannot find either primary or secondary files for thatclientidfrom that file, then I will exit from the shell script and log an error message. - Now once I have valid primary and secondary files for that
clientidthen I will start copying those files in parallel usinggnu-parallelfromlocal_server. All primary files will go toprimaryfolder and all secondary files will go tosecondaryfolder. If files are not there inhold1folder on remote servers then it should be there in thehold2folder. - Now once all the files are copied, I will verify at the end to make sure all the primary and secondary files are present for that
clientidin those two folders, but if for whatever reason I cannot find those files, then I want to exit from the shell script with a message that tells me what files are missing.
My script does the job but I would like to see if there is any better or efficient way to do these things since this is my first time writing a little complicated script so I wanted to check this out. As of now, I don't have a mechanism to exit out of the shell script if I cannot find primary or secondary files for that clientid. I also don't have a mechanism to exit out of the shell script if during the verification phase some files are missing.
#!/bin/bash
path=/home/goldy/scripts
mapfiles=(primary-mappings.txt secondary-mappings.txt)
hostfile=host-mapping.txt
machines=(machine1769.abc.host.com proctek5461.def.host.com letyrs87541.pqr.host.com)
# folders on local box where to copy files
primary=/data01/primary
secondary=/data02/secondary
# folders on remote servers from where to copy files
export hold1=/data/snapshot/$1
export hold2=/data/snapshot/$2
date1=$(date +"%s")
# this will tell me what's my clientid given my current hostname
getProperty () grep "$prop_value"
# if I can't find clientid for my hostname, then I will log a message
# and exit out of shell script with non zero status code
clientid=$(getProperty)
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)"; exit 1;
# now once I have valid clientid, then I will get primary and secondary mapping
# from the "host-mapping.txt" file
declare -a arr
mappingsByClientID ()
id=$1 # 1 to 5
file=$path/$mapfiles[$2] # 0 to 1
arr=($(sed -r "s/.*b$id=[([^]]+).*/1/; s/,/ /g" $file))
echo "$arr[@]"
# assign output of function to an array
pri=($(mappingsByClientID $clientid 0))
snd=($(mappingsByClientID $clientid 1))
echo "primary files: $pri[@]"
echo "secondary files: $snd[@]"
# figure out which machine you want to use to start copying files from
case $(hostname -f) in
*abc.host.com)
local_server=("$machines[0]")
;;
*def.host.com)
local_server=("$machines[1]")
;;
*pqr.host.com)
local_server=("$machines[2]")
;;
*) echo "unknown host: $(hostname -f), exiting." && exit 1 ;;
# ?
esac
export local="$local_server"
# deleting files before we start copying
find "$primary" -maxdepth 1 -type f -exec rm -fv ;
find "$secondary" -maxdepth 1 -type f -exec rm -fv ;
do_copy() (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
export -f do_copy
# copy files in parallel
parallel -j "$3" do_copy $primary ::: $pri[@] &
parallel -j "$3" do_copy $secondary ::: $snd[@] &
wait
echo "all files copied"
# this is for verification to see all files got copied or not
# in primary and secondary folder
set -- "$primary" "$secondary"
typeset -n array
for array in pri snd; do
for num in "$array[@]"; do
name="hello_monthly_$num_999_1.data"
if [ ! -f "$1/$name" ]; then
echo "$name" not found in "$1" >&2 && exit 1;
fi
done
shift
done
date2=$(date +"%s")
diff=$(($date2-$date1))
echo "Total Time Taken - $(($diff / 3600)) hours and $(((diff/60) % 60)) minutes and $(($diff % 60)) seconds elapsed."
Here is my host-mapping.txt file that will have lot more entries. Here, value is a valid hostname and key will be a string "k" followed by some number and that number should be there in the mapping files.
k1=machineA.abc.com
k2=machineB.abc.com
k3=machineC.def.com
k4=machineD.pqr.com
k5=machineO.abc.com
And here is my sample mapping files:
primary_mappings.txt
1=[343, 0, 686, 1372, 882, 196], 2=[687, 1, 1373, 883, 197, 736, 1030, 1569], 3=[1374, 2, 884, 737, 198, 1570], 4=[1375, 1032, 1424, 3, 885, 1228], 5=[1033, 1425, 4, 200, 886]
secondary_mappings.txt
1=[1152, 816, 1488, 336, 1008], 2=[1153, 0, 817, 337, 1489, 1009, 1297], 3=[1, 1154, 1490, 338], 4=[1155, 2, 339, 1491, 819, 1299, 1635], 5=[820, 1492, 340, 3, 1156]
For example: clientid 1 has 343, 0, 686, 1372, 882, 196 primary files and 1152, 816, 1488, 336, 1008 secondary files, similarly for other clientids as well.
bash linux shell
add a comment |Â
up vote
7
down vote
favorite
up vote
7
down vote
favorite
I am working on a little complicated shell script for the first time and this is what it should do:
- During startup, it figures out what is my
clientidby looking athost-mapping.txtfile. If I cannot findclientidfor my hostname then I need to exit from the shell script with a non-zero status code and log error message. - Now once I have a valid
clientid, I will extract primary files from theprimary-mappings.txtfile and secondary files from thesecondary-mappings.txtfile for that validclientid. If for whatever reason, I cannot find either primary or secondary files for thatclientidfrom that file, then I will exit from the shell script and log an error message. - Now once I have valid primary and secondary files for that
clientidthen I will start copying those files in parallel usinggnu-parallelfromlocal_server. All primary files will go toprimaryfolder and all secondary files will go tosecondaryfolder. If files are not there inhold1folder on remote servers then it should be there in thehold2folder. - Now once all the files are copied, I will verify at the end to make sure all the primary and secondary files are present for that
clientidin those two folders, but if for whatever reason I cannot find those files, then I want to exit from the shell script with a message that tells me what files are missing.
My script does the job but I would like to see if there is any better or efficient way to do these things since this is my first time writing a little complicated script so I wanted to check this out. As of now, I don't have a mechanism to exit out of the shell script if I cannot find primary or secondary files for that clientid. I also don't have a mechanism to exit out of the shell script if during the verification phase some files are missing.
#!/bin/bash
path=/home/goldy/scripts
mapfiles=(primary-mappings.txt secondary-mappings.txt)
hostfile=host-mapping.txt
machines=(machine1769.abc.host.com proctek5461.def.host.com letyrs87541.pqr.host.com)
# folders on local box where to copy files
primary=/data01/primary
secondary=/data02/secondary
# folders on remote servers from where to copy files
export hold1=/data/snapshot/$1
export hold2=/data/snapshot/$2
date1=$(date +"%s")
# this will tell me what's my clientid given my current hostname
getProperty () grep "$prop_value"
# if I can't find clientid for my hostname, then I will log a message
# and exit out of shell script with non zero status code
clientid=$(getProperty)
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)"; exit 1;
# now once I have valid clientid, then I will get primary and secondary mapping
# from the "host-mapping.txt" file
declare -a arr
mappingsByClientID ()
id=$1 # 1 to 5
file=$path/$mapfiles[$2] # 0 to 1
arr=($(sed -r "s/.*b$id=[([^]]+).*/1/; s/,/ /g" $file))
echo "$arr[@]"
# assign output of function to an array
pri=($(mappingsByClientID $clientid 0))
snd=($(mappingsByClientID $clientid 1))
echo "primary files: $pri[@]"
echo "secondary files: $snd[@]"
# figure out which machine you want to use to start copying files from
case $(hostname -f) in
*abc.host.com)
local_server=("$machines[0]")
;;
*def.host.com)
local_server=("$machines[1]")
;;
*pqr.host.com)
local_server=("$machines[2]")
;;
*) echo "unknown host: $(hostname -f), exiting." && exit 1 ;;
# ?
esac
export local="$local_server"
# deleting files before we start copying
find "$primary" -maxdepth 1 -type f -exec rm -fv ;
find "$secondary" -maxdepth 1 -type f -exec rm -fv ;
do_copy() (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
export -f do_copy
# copy files in parallel
parallel -j "$3" do_copy $primary ::: $pri[@] &
parallel -j "$3" do_copy $secondary ::: $snd[@] &
wait
echo "all files copied"
# this is for verification to see all files got copied or not
# in primary and secondary folder
set -- "$primary" "$secondary"
typeset -n array
for array in pri snd; do
for num in "$array[@]"; do
name="hello_monthly_$num_999_1.data"
if [ ! -f "$1/$name" ]; then
echo "$name" not found in "$1" >&2 && exit 1;
fi
done
shift
done
date2=$(date +"%s")
diff=$(($date2-$date1))
echo "Total Time Taken - $(($diff / 3600)) hours and $(((diff/60) % 60)) minutes and $(($diff % 60)) seconds elapsed."
Here is my host-mapping.txt file that will have lot more entries. Here, value is a valid hostname and key will be a string "k" followed by some number and that number should be there in the mapping files.
k1=machineA.abc.com
k2=machineB.abc.com
k3=machineC.def.com
k4=machineD.pqr.com
k5=machineO.abc.com
And here is my sample mapping files:
primary_mappings.txt
1=[343, 0, 686, 1372, 882, 196], 2=[687, 1, 1373, 883, 197, 736, 1030, 1569], 3=[1374, 2, 884, 737, 198, 1570], 4=[1375, 1032, 1424, 3, 885, 1228], 5=[1033, 1425, 4, 200, 886]
secondary_mappings.txt
1=[1152, 816, 1488, 336, 1008], 2=[1153, 0, 817, 337, 1489, 1009, 1297], 3=[1, 1154, 1490, 338], 4=[1155, 2, 339, 1491, 819, 1299, 1635], 5=[820, 1492, 340, 3, 1156]
For example: clientid 1 has 343, 0, 686, 1372, 882, 196 primary files and 1152, 816, 1488, 336, 1008 secondary files, similarly for other clientids as well.
bash linux shell
I am working on a little complicated shell script for the first time and this is what it should do:
- During startup, it figures out what is my
clientidby looking athost-mapping.txtfile. If I cannot findclientidfor my hostname then I need to exit from the shell script with a non-zero status code and log error message. - Now once I have a valid
clientid, I will extract primary files from theprimary-mappings.txtfile and secondary files from thesecondary-mappings.txtfile for that validclientid. If for whatever reason, I cannot find either primary or secondary files for thatclientidfrom that file, then I will exit from the shell script and log an error message. - Now once I have valid primary and secondary files for that
clientidthen I will start copying those files in parallel usinggnu-parallelfromlocal_server. All primary files will go toprimaryfolder and all secondary files will go tosecondaryfolder. If files are not there inhold1folder on remote servers then it should be there in thehold2folder. - Now once all the files are copied, I will verify at the end to make sure all the primary and secondary files are present for that
clientidin those two folders, but if for whatever reason I cannot find those files, then I want to exit from the shell script with a message that tells me what files are missing.
My script does the job but I would like to see if there is any better or efficient way to do these things since this is my first time writing a little complicated script so I wanted to check this out. As of now, I don't have a mechanism to exit out of the shell script if I cannot find primary or secondary files for that clientid. I also don't have a mechanism to exit out of the shell script if during the verification phase some files are missing.
#!/bin/bash
path=/home/goldy/scripts
mapfiles=(primary-mappings.txt secondary-mappings.txt)
hostfile=host-mapping.txt
machines=(machine1769.abc.host.com proctek5461.def.host.com letyrs87541.pqr.host.com)
# folders on local box where to copy files
primary=/data01/primary
secondary=/data02/secondary
# folders on remote servers from where to copy files
export hold1=/data/snapshot/$1
export hold2=/data/snapshot/$2
date1=$(date +"%s")
# this will tell me what's my clientid given my current hostname
getProperty () grep "$prop_value"
# if I can't find clientid for my hostname, then I will log a message
# and exit out of shell script with non zero status code
clientid=$(getProperty)
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)"; exit 1;
# now once I have valid clientid, then I will get primary and secondary mapping
# from the "host-mapping.txt" file
declare -a arr
mappingsByClientID ()
id=$1 # 1 to 5
file=$path/$mapfiles[$2] # 0 to 1
arr=($(sed -r "s/.*b$id=[([^]]+).*/1/; s/,/ /g" $file))
echo "$arr[@]"
# assign output of function to an array
pri=($(mappingsByClientID $clientid 0))
snd=($(mappingsByClientID $clientid 1))
echo "primary files: $pri[@]"
echo "secondary files: $snd[@]"
# figure out which machine you want to use to start copying files from
case $(hostname -f) in
*abc.host.com)
local_server=("$machines[0]")
;;
*def.host.com)
local_server=("$machines[1]")
;;
*pqr.host.com)
local_server=("$machines[2]")
;;
*) echo "unknown host: $(hostname -f), exiting." && exit 1 ;;
# ?
esac
export local="$local_server"
# deleting files before we start copying
find "$primary" -maxdepth 1 -type f -exec rm -fv ;
find "$secondary" -maxdepth 1 -type f -exec rm -fv ;
do_copy() (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
export -f do_copy
# copy files in parallel
parallel -j "$3" do_copy $primary ::: $pri[@] &
parallel -j "$3" do_copy $secondary ::: $snd[@] &
wait
echo "all files copied"
# this is for verification to see all files got copied or not
# in primary and secondary folder
set -- "$primary" "$secondary"
typeset -n array
for array in pri snd; do
for num in "$array[@]"; do
name="hello_monthly_$num_999_1.data"
if [ ! -f "$1/$name" ]; then
echo "$name" not found in "$1" >&2 && exit 1;
fi
done
shift
done
date2=$(date +"%s")
diff=$(($date2-$date1))
echo "Total Time Taken - $(($diff / 3600)) hours and $(((diff/60) % 60)) minutes and $(($diff % 60)) seconds elapsed."
Here is my host-mapping.txt file that will have lot more entries. Here, value is a valid hostname and key will be a string "k" followed by some number and that number should be there in the mapping files.
k1=machineA.abc.com
k2=machineB.abc.com
k3=machineC.def.com
k4=machineD.pqr.com
k5=machineO.abc.com
And here is my sample mapping files:
primary_mappings.txt
1=[343, 0, 686, 1372, 882, 196], 2=[687, 1, 1373, 883, 197, 736, 1030, 1569], 3=[1374, 2, 884, 737, 198, 1570], 4=[1375, 1032, 1424, 3, 885, 1228], 5=[1033, 1425, 4, 200, 886]
secondary_mappings.txt
1=[1152, 816, 1488, 336, 1008], 2=[1153, 0, 817, 337, 1489, 1009, 1297], 3=[1, 1154, 1490, 338], 4=[1155, 2, 339, 1491, 819, 1299, 1635], 5=[820, 1492, 340, 3, 1156]
For example: clientid 1 has 343, 0, 686, 1372, 882, 196 primary files and 1152, 816, 1488, 336, 1008 secondary files, similarly for other clientids as well.
bash linux shell
edited Apr 18 at 3:50
Jamalâ¦
30.1k11114225
30.1k11114225
asked Apr 18 at 2:38
user1950349
1511418
1511418
add a comment |Â
add a comment |Â
2 Answers
2
active
oldest
votes
up vote
4
down vote
Here's a useless use of cat:
prop_key=`cat $path/$hostfile | grep "$prop_value" | cut -d'=' -f1`
That could easily be
prop_key=$(grep -e "$prop_value" -- "$path/$hostfile" | cut -d= -f1)
(I've fixed your quoting there, too)
Send error messages to the standard error stream, not standard output:
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)" >&2; exit 1;
I added >&2 there - it's worth making this into a function:
die()
echo "$@" >&2
exit 1
You could insert logger -t "$0" "$*" in there if you wanted, for example.
This long line is hard to follow:
(scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold1"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1) || (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
I'm not sure why it needs to be in a sub-shell, but assuming that's necessary, we can redirect the stdout and stderr of the entire sub-shell, which will shorten command. It might be worth identifying what changes between the two invocations with a simple for loop:
scp_opts=(C -o StrictHostKeyChecking=no)
(for hold in "$hold1" "$hold2"
do scp "$scp_opts[@]"
"goldy@$local:$hold/hello_monthly_$1_999_1.data" "$2/."
&& break
done) >/dev/null 2>&1
I've just used $1 and $2 directly - but if you want to give them names, I recommend declaring them both as local:
local el primsec
For the final formatting of seconds into hours, minutes and seconds, you can use date again (as you seem to be assuming GNU date), if the duration will be less than 24 hours:
date -d @$diff '+Total Time Taken - %-H hours and %-M minutes and %-S seconds elapsed.'
That still suffers from the "1 seconds" problem. You might prefer a simple numeric form:
date -d @$diff '+Total Time Taken - %T'
If you're justgreping a file why use redirection?
â chicks
Apr 23 at 13:49
Thanks @chicks - I did mean to letgrepopen its own file. I've edited accordingly.
â Toby Speight
Apr 23 at 13:50
I recommendgrep -e "$prop_value" -- "$path/$hostfile"just in case$prop_valueor$pathstarts with a hyphen.
â Gareth Rees
Jun 1 at 9:25
add a comment |Â
up vote
3
down vote
good
- decent use of comments
- using shell functions
- good indentation
- nice output to keep the user informed
- decent variable names
- not too complicated for a shell script. And the complexity is reasonably well compartmentalized.
quick hints
- use https://www.shellcheck.net/ to get helpful suggestions. In your case it is mostly just suggesting to use quotes in a few more places and skip the
``(backticks) form of command substitution. Since you're already using the more modern$()form of command substitution that will help make the code more consistent. - line comments usually look better with a blank line above them. Also, it'd be nice to comment the initial lines 2-11 as "defaults" or something.
- usually shell functions are grouped together near the top of the script right after the variable assignments. If you want they could be in a separate file that gets
sourced. - since you only call
getPropertyonce to get theclientidwhy not include the error checking portion of things inside the function?
better output and logging
You mention wanting to log error messages, but I don't see anything that write outs to a log. This is easy enough to do with something like:
LOGFILE=/tmp/myscript.log
myerror()
local msg="$1"
echo "$(date) $msg" >> $LOGFILE
echo "$msg" > &2 # stderr
exit 1
# sometime later...
myerror "something weird happened"
So, this will take your error message, display it to the user, and append it to a logfile, before exiting. You could easily extend this to include the PID ($$). If you like having the date/time in your output you could also create a date_echo or something that takes your message and puts the date on front of it.
date_echo()
local msg="$1"
# echo "$(date) $msg" >> $LOGFILE
echo "$(date): $msg"
I agreelocalwould be an improvement so I changed that. I'm working on thestderrsuggestion.
â chicks
Apr 23 at 13:45
add a comment |Â
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
4
down vote
Here's a useless use of cat:
prop_key=`cat $path/$hostfile | grep "$prop_value" | cut -d'=' -f1`
That could easily be
prop_key=$(grep -e "$prop_value" -- "$path/$hostfile" | cut -d= -f1)
(I've fixed your quoting there, too)
Send error messages to the standard error stream, not standard output:
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)" >&2; exit 1;
I added >&2 there - it's worth making this into a function:
die()
echo "$@" >&2
exit 1
You could insert logger -t "$0" "$*" in there if you wanted, for example.
This long line is hard to follow:
(scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold1"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1) || (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
I'm not sure why it needs to be in a sub-shell, but assuming that's necessary, we can redirect the stdout and stderr of the entire sub-shell, which will shorten command. It might be worth identifying what changes between the two invocations with a simple for loop:
scp_opts=(C -o StrictHostKeyChecking=no)
(for hold in "$hold1" "$hold2"
do scp "$scp_opts[@]"
"goldy@$local:$hold/hello_monthly_$1_999_1.data" "$2/."
&& break
done) >/dev/null 2>&1
I've just used $1 and $2 directly - but if you want to give them names, I recommend declaring them both as local:
local el primsec
For the final formatting of seconds into hours, minutes and seconds, you can use date again (as you seem to be assuming GNU date), if the duration will be less than 24 hours:
date -d @$diff '+Total Time Taken - %-H hours and %-M minutes and %-S seconds elapsed.'
That still suffers from the "1 seconds" problem. You might prefer a simple numeric form:
date -d @$diff '+Total Time Taken - %T'
If you're justgreping a file why use redirection?
â chicks
Apr 23 at 13:49
Thanks @chicks - I did mean to letgrepopen its own file. I've edited accordingly.
â Toby Speight
Apr 23 at 13:50
I recommendgrep -e "$prop_value" -- "$path/$hostfile"just in case$prop_valueor$pathstarts with a hyphen.
â Gareth Rees
Jun 1 at 9:25
add a comment |Â
up vote
4
down vote
Here's a useless use of cat:
prop_key=`cat $path/$hostfile | grep "$prop_value" | cut -d'=' -f1`
That could easily be
prop_key=$(grep -e "$prop_value" -- "$path/$hostfile" | cut -d= -f1)
(I've fixed your quoting there, too)
Send error messages to the standard error stream, not standard output:
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)" >&2; exit 1;
I added >&2 there - it's worth making this into a function:
die()
echo "$@" >&2
exit 1
You could insert logger -t "$0" "$*" in there if you wanted, for example.
This long line is hard to follow:
(scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold1"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1) || (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
I'm not sure why it needs to be in a sub-shell, but assuming that's necessary, we can redirect the stdout and stderr of the entire sub-shell, which will shorten command. It might be worth identifying what changes between the two invocations with a simple for loop:
scp_opts=(C -o StrictHostKeyChecking=no)
(for hold in "$hold1" "$hold2"
do scp "$scp_opts[@]"
"goldy@$local:$hold/hello_monthly_$1_999_1.data" "$2/."
&& break
done) >/dev/null 2>&1
I've just used $1 and $2 directly - but if you want to give them names, I recommend declaring them both as local:
local el primsec
For the final formatting of seconds into hours, minutes and seconds, you can use date again (as you seem to be assuming GNU date), if the duration will be less than 24 hours:
date -d @$diff '+Total Time Taken - %-H hours and %-M minutes and %-S seconds elapsed.'
That still suffers from the "1 seconds" problem. You might prefer a simple numeric form:
date -d @$diff '+Total Time Taken - %T'
If you're justgreping a file why use redirection?
â chicks
Apr 23 at 13:49
Thanks @chicks - I did mean to letgrepopen its own file. I've edited accordingly.
â Toby Speight
Apr 23 at 13:50
I recommendgrep -e "$prop_value" -- "$path/$hostfile"just in case$prop_valueor$pathstarts with a hyphen.
â Gareth Rees
Jun 1 at 9:25
add a comment |Â
up vote
4
down vote
up vote
4
down vote
Here's a useless use of cat:
prop_key=`cat $path/$hostfile | grep "$prop_value" | cut -d'=' -f1`
That could easily be
prop_key=$(grep -e "$prop_value" -- "$path/$hostfile" | cut -d= -f1)
(I've fixed your quoting there, too)
Send error messages to the standard error stream, not standard output:
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)" >&2; exit 1;
I added >&2 there - it's worth making this into a function:
die()
echo "$@" >&2
exit 1
You could insert logger -t "$0" "$*" in there if you wanted, for example.
This long line is hard to follow:
(scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold1"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1) || (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
I'm not sure why it needs to be in a sub-shell, but assuming that's necessary, we can redirect the stdout and stderr of the entire sub-shell, which will shorten command. It might be worth identifying what changes between the two invocations with a simple for loop:
scp_opts=(C -o StrictHostKeyChecking=no)
(for hold in "$hold1" "$hold2"
do scp "$scp_opts[@]"
"goldy@$local:$hold/hello_monthly_$1_999_1.data" "$2/."
&& break
done) >/dev/null 2>&1
I've just used $1 and $2 directly - but if you want to give them names, I recommend declaring them both as local:
local el primsec
For the final formatting of seconds into hours, minutes and seconds, you can use date again (as you seem to be assuming GNU date), if the duration will be less than 24 hours:
date -d @$diff '+Total Time Taken - %-H hours and %-M minutes and %-S seconds elapsed.'
That still suffers from the "1 seconds" problem. You might prefer a simple numeric form:
date -d @$diff '+Total Time Taken - %T'
Here's a useless use of cat:
prop_key=`cat $path/$hostfile | grep "$prop_value" | cut -d'=' -f1`
That could easily be
prop_key=$(grep -e "$prop_value" -- "$path/$hostfile" | cut -d= -f1)
(I've fixed your quoting there, too)
Send error messages to the standard error stream, not standard output:
[ -z "$clientid" ] && echo "cannot find clientid for $(hostname -f)" >&2; exit 1;
I added >&2 there - it's worth making this into a function:
die()
echo "$@" >&2
exit 1
You could insert logger -t "$0" "$*" in there if you wanted, for example.
This long line is hard to follow:
(scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold1"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1) || (scp -C -o StrictHostKeyChecking=no goldy@"$local":"$hold2"/hello_monthly_"$el"_999_1.data "$primsec"/. > /dev/null 2>&1)
I'm not sure why it needs to be in a sub-shell, but assuming that's necessary, we can redirect the stdout and stderr of the entire sub-shell, which will shorten command. It might be worth identifying what changes between the two invocations with a simple for loop:
scp_opts=(C -o StrictHostKeyChecking=no)
(for hold in "$hold1" "$hold2"
do scp "$scp_opts[@]"
"goldy@$local:$hold/hello_monthly_$1_999_1.data" "$2/."
&& break
done) >/dev/null 2>&1
I've just used $1 and $2 directly - but if you want to give them names, I recommend declaring them both as local:
local el primsec
For the final formatting of seconds into hours, minutes and seconds, you can use date again (as you seem to be assuming GNU date), if the duration will be less than 24 hours:
date -d @$diff '+Total Time Taken - %-H hours and %-M minutes and %-S seconds elapsed.'
That still suffers from the "1 seconds" problem. You might prefer a simple numeric form:
date -d @$diff '+Total Time Taken - %T'
edited Jun 1 at 9:26
answered Apr 23 at 8:44
Toby Speight
17.5k13489
17.5k13489
If you're justgreping a file why use redirection?
â chicks
Apr 23 at 13:49
Thanks @chicks - I did mean to letgrepopen its own file. I've edited accordingly.
â Toby Speight
Apr 23 at 13:50
I recommendgrep -e "$prop_value" -- "$path/$hostfile"just in case$prop_valueor$pathstarts with a hyphen.
â Gareth Rees
Jun 1 at 9:25
add a comment |Â
If you're justgreping a file why use redirection?
â chicks
Apr 23 at 13:49
Thanks @chicks - I did mean to letgrepopen its own file. I've edited accordingly.
â Toby Speight
Apr 23 at 13:50
I recommendgrep -e "$prop_value" -- "$path/$hostfile"just in case$prop_valueor$pathstarts with a hyphen.
â Gareth Rees
Jun 1 at 9:25
If you're just
greping a file why use redirection?â chicks
Apr 23 at 13:49
If you're just
greping a file why use redirection?â chicks
Apr 23 at 13:49
Thanks @chicks - I did mean to let
grep open its own file. I've edited accordingly.â Toby Speight
Apr 23 at 13:50
Thanks @chicks - I did mean to let
grep open its own file. I've edited accordingly.â Toby Speight
Apr 23 at 13:50
I recommend
grep -e "$prop_value" -- "$path/$hostfile" just in case $prop_value or $path starts with a hyphen.â Gareth Rees
Jun 1 at 9:25
I recommend
grep -e "$prop_value" -- "$path/$hostfile" just in case $prop_value or $path starts with a hyphen.â Gareth Rees
Jun 1 at 9:25
add a comment |Â
up vote
3
down vote
good
- decent use of comments
- using shell functions
- good indentation
- nice output to keep the user informed
- decent variable names
- not too complicated for a shell script. And the complexity is reasonably well compartmentalized.
quick hints
- use https://www.shellcheck.net/ to get helpful suggestions. In your case it is mostly just suggesting to use quotes in a few more places and skip the
``(backticks) form of command substitution. Since you're already using the more modern$()form of command substitution that will help make the code more consistent. - line comments usually look better with a blank line above them. Also, it'd be nice to comment the initial lines 2-11 as "defaults" or something.
- usually shell functions are grouped together near the top of the script right after the variable assignments. If you want they could be in a separate file that gets
sourced. - since you only call
getPropertyonce to get theclientidwhy not include the error checking portion of things inside the function?
better output and logging
You mention wanting to log error messages, but I don't see anything that write outs to a log. This is easy enough to do with something like:
LOGFILE=/tmp/myscript.log
myerror()
local msg="$1"
echo "$(date) $msg" >> $LOGFILE
echo "$msg" > &2 # stderr
exit 1
# sometime later...
myerror "something weird happened"
So, this will take your error message, display it to the user, and append it to a logfile, before exiting. You could easily extend this to include the PID ($$). If you like having the date/time in your output you could also create a date_echo or something that takes your message and puts the date on front of it.
date_echo()
local msg="$1"
# echo "$(date) $msg" >> $LOGFILE
echo "$(date): $msg"
I agreelocalwould be an improvement so I changed that. I'm working on thestderrsuggestion.
â chicks
Apr 23 at 13:45
add a comment |Â
up vote
3
down vote
good
- decent use of comments
- using shell functions
- good indentation
- nice output to keep the user informed
- decent variable names
- not too complicated for a shell script. And the complexity is reasonably well compartmentalized.
quick hints
- use https://www.shellcheck.net/ to get helpful suggestions. In your case it is mostly just suggesting to use quotes in a few more places and skip the
``(backticks) form of command substitution. Since you're already using the more modern$()form of command substitution that will help make the code more consistent. - line comments usually look better with a blank line above them. Also, it'd be nice to comment the initial lines 2-11 as "defaults" or something.
- usually shell functions are grouped together near the top of the script right after the variable assignments. If you want they could be in a separate file that gets
sourced. - since you only call
getPropertyonce to get theclientidwhy not include the error checking portion of things inside the function?
better output and logging
You mention wanting to log error messages, but I don't see anything that write outs to a log. This is easy enough to do with something like:
LOGFILE=/tmp/myscript.log
myerror()
local msg="$1"
echo "$(date) $msg" >> $LOGFILE
echo "$msg" > &2 # stderr
exit 1
# sometime later...
myerror "something weird happened"
So, this will take your error message, display it to the user, and append it to a logfile, before exiting. You could easily extend this to include the PID ($$). If you like having the date/time in your output you could also create a date_echo or something that takes your message and puts the date on front of it.
date_echo()
local msg="$1"
# echo "$(date) $msg" >> $LOGFILE
echo "$(date): $msg"
I agreelocalwould be an improvement so I changed that. I'm working on thestderrsuggestion.
â chicks
Apr 23 at 13:45
add a comment |Â
up vote
3
down vote
up vote
3
down vote
good
- decent use of comments
- using shell functions
- good indentation
- nice output to keep the user informed
- decent variable names
- not too complicated for a shell script. And the complexity is reasonably well compartmentalized.
quick hints
- use https://www.shellcheck.net/ to get helpful suggestions. In your case it is mostly just suggesting to use quotes in a few more places and skip the
``(backticks) form of command substitution. Since you're already using the more modern$()form of command substitution that will help make the code more consistent. - line comments usually look better with a blank line above them. Also, it'd be nice to comment the initial lines 2-11 as "defaults" or something.
- usually shell functions are grouped together near the top of the script right after the variable assignments. If you want they could be in a separate file that gets
sourced. - since you only call
getPropertyonce to get theclientidwhy not include the error checking portion of things inside the function?
better output and logging
You mention wanting to log error messages, but I don't see anything that write outs to a log. This is easy enough to do with something like:
LOGFILE=/tmp/myscript.log
myerror()
local msg="$1"
echo "$(date) $msg" >> $LOGFILE
echo "$msg" > &2 # stderr
exit 1
# sometime later...
myerror "something weird happened"
So, this will take your error message, display it to the user, and append it to a logfile, before exiting. You could easily extend this to include the PID ($$). If you like having the date/time in your output you could also create a date_echo or something that takes your message and puts the date on front of it.
date_echo()
local msg="$1"
# echo "$(date) $msg" >> $LOGFILE
echo "$(date): $msg"
good
- decent use of comments
- using shell functions
- good indentation
- nice output to keep the user informed
- decent variable names
- not too complicated for a shell script. And the complexity is reasonably well compartmentalized.
quick hints
- use https://www.shellcheck.net/ to get helpful suggestions. In your case it is mostly just suggesting to use quotes in a few more places and skip the
``(backticks) form of command substitution. Since you're already using the more modern$()form of command substitution that will help make the code more consistent. - line comments usually look better with a blank line above them. Also, it'd be nice to comment the initial lines 2-11 as "defaults" or something.
- usually shell functions are grouped together near the top of the script right after the variable assignments. If you want they could be in a separate file that gets
sourced. - since you only call
getPropertyonce to get theclientidwhy not include the error checking portion of things inside the function?
better output and logging
You mention wanting to log error messages, but I don't see anything that write outs to a log. This is easy enough to do with something like:
LOGFILE=/tmp/myscript.log
myerror()
local msg="$1"
echo "$(date) $msg" >> $LOGFILE
echo "$msg" > &2 # stderr
exit 1
# sometime later...
myerror "something weird happened"
So, this will take your error message, display it to the user, and append it to a logfile, before exiting. You could easily extend this to include the PID ($$). If you like having the date/time in your output you could also create a date_echo or something that takes your message and puts the date on front of it.
date_echo()
local msg="$1"
# echo "$(date) $msg" >> $LOGFILE
echo "$(date): $msg"
edited Apr 23 at 13:47
answered Apr 23 at 3:00
chicks
1,3792818
1,3792818
I agreelocalwould be an improvement so I changed that. I'm working on thestderrsuggestion.
â chicks
Apr 23 at 13:45
add a comment |Â
I agreelocalwould be an improvement so I changed that. I'm working on thestderrsuggestion.
â chicks
Apr 23 at 13:45
I agree
local would be an improvement so I changed that. I'm working on the stderr suggestion.â chicks
Apr 23 at 13:45
I agree
local would be an improvement so I changed that. I'm working on the stderr suggestion.â chicks
Apr 23 at 13:45
add a comment |Â
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fcodereview.stackexchange.com%2fquestions%2f192343%2fcopy-files-in-parallel-by-reading-what-files-to-copy-from-few-other-files%23new-answer', 'question_page');
);
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password