Home Reference Source

References

parsers

summary
public

Parses a WARC file automatically detecting if it is gzipped.

public

Utility class that provides the means to detect if a file is gzipped or not

public

Parse a WARC.gz file

public

Parse a WARC file

public

Transforms a WARC file ReadStream into its individual WARCRecords

public

Creates async iterator that yields WARCRecords given a readable stream of a WARC file

requestCapturers

summary
public

Represents A Unique Request And Response Chain As Made By A Page Consolidates The Modification Of HTTP/2 Into HTTP/1.1 Provides Utility Functionality For Serialization To WARC

public

Represents a request and response

public

chrome-remote-interface-extra request capturer

public

The remote electron request chapturer to use along side ElectronWARCGenerator See the documentation for attach and maybeNetworkMessage for setup information Controlled via startCapturing and stopCapturing

public

Puppeteer Request Capturer

public
public

The remote chrome request chapturer to use along side RemoteChromeWARCGenerator The only setup required is to pass the chrome-remote-interface Network object Controlled via startCapturing and stopCapturing

public

Class Responsible For Managing The Captured Request / Responses For A Page

public
public
public
public
public
public
public
public

utils

summary
public

Electron debugger errors passed to the callback are plain objects not Errors but contain information concerning the error.

public

F ensureWARCFilename(filename: string, gzipping: boolean): string

Ensure that the supplied name for the WARC is correct

public

F getResBodyElectron(requestId: string, wcDebugger: Object): Promise<Buffer>

Electron requires the use of the debugger to retrieve the requests response body

public

Converts an HTTP headers object into its string representation

public

Converts an HTTP request headers object into its string representation

public

Test to see if a plain object is empty

public
public

V constants: {canReplayProtocols: Set<string>, SPACE: string, H2Method: string, HTTP1D1: string, DASH: string, H2path: string}

public
public
public
public
public

warcRecord

summary
public

Progressively builds warc records by consuming the file line by line

public

WARC record class.

public

Utility class for parsing parts of WARC records

public
public
public
public
public

writers

summary
public

WARC Generator for use with puppeteer

public

WARC Generator for use with Electron

public

WARC Generator for use with puppeteer

public

WARC Generator for use with puppeteer

public

WARC Generator for use with chrome-remote-interface

public

WARC Generator for use with request

public

Base class used for writing to the WARC

public
public
public
public
public
public
public
public
public
public
public
public
public
public