References
parsers
summary | ||
public |
Parses a WARC file automatically detecting if it is gzipped. |
|
public |
Utility class that provides the means to detect if a file is gzipped or not |
|
public |
Parse a WARC.gz file |
|
public |
Parse a WARC file |
|
public |
Transforms a WARC file ReadStream into its individual WARCRecords |
|
public |
F async * recordIterator(warcStream: ReadStream | Gunzip): AsyncIterator<WARCRecord> Creates async iterator that yields WARCRecords given a readable stream of a WARC file |
requestCapturers
summary | ||
public |
Represents A Unique Request And Response Chain As Made By A Page Consolidates The Modification Of HTTP/2 Into HTTP/1.1 Provides Utility Functionality For Serialization To WARC |
|
public |
Represents a request and response |
|
public |
chrome-remote-interface-extra request capturer |
|
public |
The remote electron request chapturer to use along side ElectronWARCGenerator See the documentation for attach and maybeNetworkMessage for setup information Controlled via startCapturing and stopCapturing |
|
public |
Puppeteer Request Capturer |
|
public |
|
|
public |
The remote chrome request chapturer to use along side RemoteChromeWARCGenerator The only setup required is to pass the chrome-remote-interface Network object Controlled via startCapturing and stopCapturing |
|
public |
Class Responsible For Managing The Captured Request / Responses For A Page |
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
utils
summary | ||
public |
Electron debugger errors passed to the callback are plain objects not Errors but contain information concerning the error. |
|
public |
F ensureWARCFilename(filename: string, gzipping: boolean): string Ensure that the supplied name for the WARC is correct |
|
public |
F getResBodyElectron(requestId: string, wcDebugger: Object): Promise<Buffer> Electron requires the use of the debugger to retrieve the requests response body |
|
public |
F stringifyHeaders(headers: Object): string Converts an HTTP headers object into its string representation |
|
public |
F stringifyRequestHeaders(headers: Object, host: string): string Converts an HTTP request headers object into its string representation |
|
public |
F isEmptyPlainObject(object: Object): boolean Test to see if a |
|
public |
V ElectronGetResError: * |
|
public |
V constants: {canReplayProtocols: Set<string>, SPACE: string, H2Method: string, HTTP1D1: string, DASH: string, H2path: string} |
|
public |
V ensureWARCFilename: * |
|
public |
V getResBodyElectron: * |
|
public |
V isEmptyPlainObject: * |
|
public |
V stringifyHeaders: * |
|
public |
|
warcRecord
summary | ||
public |
Progressively builds warc records by consuming the file line by line |
|
public |
WARC record class. |
|
public |
Utility class for parsing parts of WARC records |
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
writers
summary | ||
public |
WARC Generator for use with puppeteer |
|
public |
WARC Generator for use with Electron |
|
public |
WARC Generator for use with puppeteer |
|
public |
WARC Generator for use with puppeteer |
|
public |
WARC Generator for use with chrome-remote-interface |
|
public |
WARC Generator for use with request |
|
public |
Base class used for writing to the WARC |
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|