Crawler

The Pointer Crawler enables you to automatically gather and analyze content from your product, creating a comprehensive knowledge base for AI-powered features.

Prerequisites

Node.js version 16 or higher
Access to Pointer dashboard

Installation

Install the Pointer CLI globally using npm:

npm install -g pointer-cli

Verify the installation:

pointer --version

Authentication

Create an API key

Navigate to API Keys

Go to your Keys settings in the Pointer dashboard.

Generate new key

Click Create new key and provide:

Name: Descriptive identifier (e.g., “CLI Production”)
Description: Optional context about key usage
Expiration: Optional expiry date (defaults to never expire)

Copy your secret key

Save the generated key immediately - it won’t be shown again. Keys follow the format:

pt_sec_*****************************************

Configure authentication

Set your secret key using one of these methods:

export POINTER_SECRET_KEY="pt_sec_your_key_here"

Environment variables are recommended for security. Command-line options may expose keys in shell history.

Core workflow

Step 1: Initialize your website

Start by adding your website to the crawler configuration:

pointer init

The interactive prompt will guide you through:

Entering a friendly name for identification
Providing your website URL
Confirming the configuration

Step 2: Scrape your content

Begin the automated content collection:

pointer scrape

Choose from interactive options:

Scraping mode: Headless (fast) or Browser (with authentication)
Crawl depth: Fast (surface content) or Deep (interactive elements)
PII protection: Configure sensitivity and redaction settings

The CLI saves your progress automatically. If interrupted, it will offer to resume from where it left off.

Step 3: Upload for analysis

Send your scraped content to Pointer for processing:

pointer upload

The CLI will:

Display a summary of collected data
Confirm the upload scope
Transfer content to your knowledge base

Command reference

Primary commands

Command	Description	Authentication
`pointer init`	Add a website to crawl	Required
`pointer scrape`	Collect content from configured websites	Required
`pointer upload`	Transfer scraped data to Pointer	Required
`pointer status`	Check crawl processing status	Required
`pointer list`	View local scraped data	Not required
`pointer cleanup`	Remove all local data	Not required
`pointer purge`	Delete server-side crawl data	Required

Global options

Available for all commands:

Option	Description
`-s, --secret-key <key>`	API secret key (overrides environment variable)
`-v, --version`	Display CLI version
`--help`	Show command help

Scraping options

Configure pointer scrape behavior:

Option	Description	Default
`--max-pages <number>`	Maximum pages to crawl	200
`--concurrency <number>`	Parallel page processing	2
`--fast`	Use fast crawl mode	Interactive prompt
`--no-pii-protection`	Disable PII detection	PII protection enabled
`--pii-sensitivity <level>`	Set detection level (low/medium/high)	Interactive prompt
`--log-level <level>`	Logging verbosity	info

Best practices

Use interactive mode

Run commands without options for guided workflows:

pointer init     # Interactive website setup
pointer scrape   # Guided crawling configuration
pointer upload   # Selection-based upload

The CLI provides clear prompts and smart defaults for all operations.

Secure your credentials

Optimize crawling

Manage your data

Automation examples

While the CLI is designed for interactive use, automation is supported for CI/CD pipelines:

# Automated crawling with predetermined settings
pointer scrape --max-pages 100 --concurrency 5 --fast --no-pii-protection

# Direct status check for specific crawl
pointer status --crawl-id abc123 --pages

# Skip confirmations for scripted cleanup
pointer purge --crawl-id abc123 --force

Use automation options carefully. Interactive mode provides safety confirmations and validation that prevent common errors.

Troubleshooting

Authentication errors

If you encounter authentication issues:

Verify your API key is valid in the dashboard
Check environment variable is set correctly: echo $POINTER_SECRET_KEY
Ensure the key hasn’t expired
Confirm you have necessary permissions

Crawling interruptions

The crawler automatically saves progress. If interrupted:

pointer scrape
# Will prompt: "Resume from where it left off?"

Upload limitations

Maximum 500 pages per upload (API limit)
Large crawls are automatically truncated
Use --max-pages to control crawl size upfront

Next steps

After successfully crawling and uploading your content:

View your enriched knowledge base in the Knowledge section
Configure AI features to leverage the collected data
Monitor analytics to understand content usage
Set up regular crawls to keep knowledge current

Overview

Add to your:

Core AI features

Knowledge

Analytics

Customization

FAQs

Prerequisites

Installation

Authentication

Create an API key

Configure authentication

Core workflow

Step 1: Initialize your website

Step 2: Scrape your content

Step 3: Upload for analysis

Command reference

Primary commands

Global options

Scraping options

Best practices

Automation examples

Troubleshooting

Authentication errors

Crawling interruptions

Upload limitations

Next steps

Overview

Add to your:

Core AI features

Knowledge

Analytics

Customization

FAQs

​Prerequisites

​Installation

​Authentication

​Create an API key

​Configure authentication

​Core workflow

​Step 1: Initialize your website

​Step 2: Scrape your content

​Step 3: Upload for analysis

​Command reference

​Primary commands

​Global options

​Scraping options

​Best practices

​Automation examples

​Troubleshooting

​Authentication errors

​Crawling interruptions

​Upload limitations

​Next steps

Prerequisites

Installation

Authentication

Create an API key

Configure authentication

Core workflow

Step 1: Initialize your website

Step 2: Scrape your content

Step 3: Upload for analysis

Command reference

Primary commands

Global options

Scraping options

Best practices

Automation examples

Troubleshooting

Authentication errors

Crawling interruptions

Upload limitations

Next steps