Web Content Extractor Command Line Options
It is possible to perform Web Content Extractor commands from the command line. To launch the program at a specified time, you can use "Windows Task Scheduler", or any other scheduler, controlling the launch parameters from command line. Possible key prefixes are "-" and "/".
Syntax: WCExtractor.exe ["projectfile"] [-dr] [-dt] [-rt] [-rft] [-at"filename"] [-s] [-ddr] [-fr] [-fr"column name{criteria index}value"] [-qe] [-qe"filename"][-ex]
Projectfile - the file name of the project (*.wcepr) to open.
Key |
Command |
-dr |
delete all results/records |
-dt |
delete all tasks |
-rt |
reset all tasks |
-rft |
reset failed tasks |
-at"filename" |
add new tasks from file, filename - name of the CSV or TXT file that contains URLs separated by newlines. |
-s |
start the extraction process |
-ddr |
delete duplicate records |
-fr |
filter results |
-fr"column name{criteria index}value" |
filter results, criteria index: 0 - contains, 1 - does not contain, 2 - equals, 3 - does not equal, 4 - begins with, 5 - does not begin with, 6 - ends with, 7 - does not end with, 8 - is larger than, 9 - is less than |
-qe |
export results/records |
-qe"filename" |
export results/records, filename - name of the output file. |
-ex |
exit when all tasks are done. |
Examples
To launch the program, then open the "myproject.wcepr" project file, delete all previous results, reset all tasks, start the extraction process, export data and close the program, you should use the following command:
"C:\Program Files\Web Content Extractor\WCExtractor.exe" "C:\Program Files\Web Content Extractor\myproject.wcepr" -dr -rt -s -qe -ex
To launch the program, then open the "myproject.wcepr" project file, delete all previous tasks, add new tasks from "urls.csv" file, start the extraction process, export data and close the program, you should use the following command:
"C:\Program Files\Web Content Extractor\WCExtractor.exe" "C:\Program Files\Web Content Extractor\myproject.wcepr" -dt -at"C:\Program Files\Web Content Extractor\urls.csv" -s -qe -ex
Note: The program will export data, using the export configuration, which was the last to be executed in the project. If the project has never been exported, then this function is not available.
