|
| 1 | +<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN" |
| 2 | + "http://www.w3.org/TR/html4/strict.dtd"> |
| 3 | +<!-- AUTOGENERATED FILE: ALL EDITS WILL BE LOST!!! --> |
| 4 | +<html lang="en"> |
| 5 | +<head> |
| 6 | +<meta http-equiv="Content-Type" content="text/html; charset=UTF-8"> |
| 7 | +<meta name="generator" content="cow"> |
| 8 | +<title>The GATE Cloud Paralleliser (GCP)</title> |
| 9 | +<style type="text/css"> |
| 10 | +/*<![CDATA[*/ |
| 11 | +<!-- |
| 12 | +html, body { |
| 13 | + background: #fff; |
| 14 | + color: #000; |
| 15 | + font-family: sans-serif; |
| 16 | +} |
| 17 | +h1,h2,h3,h4,h5,p,ul,ol { font-family: sans-serif; } |
| 18 | +pre, pre ol, pre ul { font-family: monospace; } |
| 19 | +h3.navhead { |
| 20 | + font-size: 100%; |
| 21 | +} |
| 22 | +div.banner { |
| 23 | + border: none; |
| 24 | + margin-right: 0px; |
| 25 | + margin-left: 0px; |
| 26 | + padding: 0.09em; |
| 27 | + text-align: center; |
| 28 | + font-weight: bold; |
| 29 | +} |
| 30 | +div.banner a:link, div.banner { |
| 31 | + background: #A0D0F0; |
| 32 | + color: #000000; |
| 33 | +} |
| 34 | +div.banner a:active { |
| 35 | + background: #000000; |
| 36 | + color: #FFFFFF; |
| 37 | +} |
| 38 | +div.banner a:hover { |
| 39 | + background: #000000; |
| 40 | + color: #FFFFFF; |
| 41 | +--> |
| 42 | +/*]]>*/ |
| 43 | +</style> |
| 44 | +</head> |
| 45 | +<body bgcolor="#FFFFFF" text="#000000"> |
| 46 | +<h1 class="cow-title-heading">The GATE Cloud Paralleliser (GCP)</h1> |
| 47 | + |
| 48 | +<p>GCP is a tool designed to support the execution of pipelines built using GATE |
| 49 | +Developer over large collections of thousands or millions of documents, using a |
| 50 | +multi-threaded architecture to make the best use of today's multi-core processors.</p> |
| 51 | + |
| 52 | +<p>GCP tasks or <em>batches</em> are defined using an extensible XML syntax, describing |
| 53 | +the location and format of the input files, the GATE application to be run, and |
| 54 | +the kinds of outputs required. A number of standard input and output handlers |
| 55 | +are provided, but all the various components are pluggable so custom |
| 56 | +implementations can be used if the task requires it. GCP keeps track of the |
| 57 | +progress of each batch in a human- and machine-readable XML format, and is |
| 58 | +designed so that if a running batch is interrupted for any reason it can be |
| 59 | +re-run with the same settings and GCP will automatically continue from where it |
| 60 | +left off.</p> |
| 61 | + |
| 62 | +<p>A users' manual is available here: <a class="cow-url" href="doc/gcp-guide.pdf">PDF</a>.</p> |
| 63 | +</body></html> |
0 commit comments