Reference page

Custom R Connections

R connections are the standard abstraction for I/O in R -- readLines(), writeLines(), readBin(), writeBin(), scan(), and many other functions all operate on connections. miniextendr lets you create custom R connections backed by Rust types, enabling you to expose any Rust I/O source or sink to R's connection infrastructure.

8 min read 1493 words 70 pages in this section

R connections are the standard abstraction for I/O in R – readLines(), writeLines(), readBin(), writeBin(), scan(), and many other functions all operate on connections. miniextendr lets you create custom R connections backed by Rust types, enabling you to expose any Rust I/O source or sink to R’s connection infrastructure.

Feature flag: connections (opt-in, experimental)

[dependencies]
miniextendr-api = { version = "0.1", features = ["connections"] }

Warning: R explicitly reserves the right to change the connection C API without backward compatibility. This module performs a compile-time ABI version check, but future R releases may break it. See Safety & Version Checking.

🔗Table of Contents

🔗Quick Start

Here is a minimal read-only connection that serves an in-memory string:

use miniextendr_api::connection::{RConnectionImpl, RCustomConnection};
use miniextendr_api::ffi::SEXP;
use miniextendr_api::miniextendr;

struct StringSource {
    data: Vec<u8>,
    pos: usize,
}

impl RConnectionImpl for StringSource {
    fn open(&mut self) -> bool {
        self.pos = 0;
        true
    }

    fn read(&mut self, buf: &mut [u8]) -> usize {
        let remaining = self.data.len().saturating_sub(self.pos);
        let n = buf.len().min(remaining);
        buf[..n].copy_from_slice(&self.data[self.pos..self.pos + n]);
        self.pos += n;
        n
    }
}

#[miniextendr]
pub fn my_string_connection(text: &str) -> SEXP {
    RCustomConnection::new()
        .description("string source")
        .mode("r")
        .class_name("stringSource")
        .can_read(true)
        .text(true)
        .build(StringSource {
            data: text.as_bytes().to_vec(),
            pos: 0,
        })
}

From R:

conn <- my_string_connection("hello\nworld")
readLines(conn)
#> [1] "hello" "world"
close(conn)

🔗The RConnectionImpl Trait

Implement RConnectionImpl on your type to define connection behavior. All methods have sensible defaults – you only override what you need.

pub trait RConnectionImpl: Sized + 'static {
    fn open(&mut self) -> bool { true }
    fn close(&mut self) {}
    fn destroy(&mut self) {}
    fn read(&mut self, buf: &mut [u8]) -> usize { 0 }
    fn write(&mut self, buf: &[u8]) -> usize { 0 }
    fn fgetc(&mut self) -> i32 { /* reads one byte via read() */ }
    fn seek(&mut self, where_: f64, origin: i32, rw: i32) -> f64 { -1.0 }
    fn truncate(&mut self) {}
    fn flush(&mut self) -> i32 { 0 }
    fn vfprintf(&mut self, fmt: *const c_char, ap: *mut c_void) -> i32 { -1 }
}

🔗Method Reference

Method	When Called	Return Value	Notes
`open`	`open(conn)` in R or auto-open on first use	`true` = success, `false` = failure
`close`	`close(conn)` in R	(none)	Called before `destroy`
`destroy`	Connection is garbage collected	(none)	Your type is dropped automatically after this
`read`	`readBin()`, `readLines()`, etc.	Number of bytes read (0 = EOF)
`write`	`writeBin()`, `writeLines()`, etc.	Number of bytes written
`fgetc`	Character-by-character reading	Byte as `i32`, or -1 on EOF	Default delegates to `read`
`seek`	`seek(conn, where, origin)`	New position, or -1 on failure	See origin codes below
`truncate`	`truncate(conn)`	(none)
`flush`	`flush(conn)`	0 = success, non-zero = failure
`vfprintf`	Formatted print (rarely needed)	Characters written, or -1	R typically uses `write`

🔗Seek Origin Codes

Value	Meaning	R Equivalent
1	From start	`"start"`
2	From current position	`"current"`
3	From end	`"end"`

When where_ is NaN, the caller is querying the current position – return it without moving.

🔗Memory Management

Your type is Box-ed and stored in the connection’s private field. When the connection is garbage collected, destroy() is called first, then your type is automatically dropped. You do not need to free memory manually.

🔗Builder Pattern

RCustomConnection configures and creates the R connection object:

let conn_sexp = RCustomConnection::new()
    .description("my data source")   // Shown in summary(conn)
    .mode("rb")                      // Open mode (see table below)
    .class_name("myConnection")      // R class (along with "connection")
    .text(false)                     // Binary mode
    .can_read(true)                  // Supports reading
    .can_write(false)                // Does not support writing
    .can_seek(true)                  // Supports seeking
    .blocking(true)                  // Blocking I/O (default)
    .build(my_state);                // Consumes the state

🔗Builder Methods

Method	Default	Description
`description(s)`	`"custom connection"`	Description shown in `summary()`
`mode(s)`	`"r"`	Open mode string
`class_name(s)`	`"customConnection"`	First element of the R class vector
`text(bool)`	Inferred from mode	Text vs binary mode
`can_read(bool)`	Inferred from mode	Whether reading is supported
`can_write(bool)`	Inferred from mode	Whether writing is supported
`can_seek(bool)`	`false`	Whether seeking is supported
`blocking(bool)`	`true`	Whether I/O is blocking

🔗Mode Strings

Mode	Meaning
`"r"`	Read text
`"w"`	Write text
`"a"`	Append text
`"rb"`	Read binary
`"wb"`	Write binary
`"ab"`	Append binary
`"r+"`	Read/write text
`"r+b"` or `"rb+"`	Read/write binary

The mode string must be at most 4 characters.

🔗Return Value

build() returns a SEXP – an R connection object that can be returned directly from a #[miniextendr] function.

🔗Connection Lifecycle

build()
  |
  v
[Connection created in CLOSED state]
  |
  v
open() callback  <-- triggered by open(conn) or auto-open
  |
  v
[OPEN state]
  |  read() / write() / seek() / flush() / fgetc()
  v
close() callback  <-- triggered by close(conn) or auto-close
  |
  v
[CLOSED state]
  |
  v
destroy() callback  <-- triggered by garbage collection
  |
  v
[Rust type dropped (Box<T> freed)]

Key points:

R_new_custom_connection creates connections in the CLOSED state. R opens them automatically on first use.
close() is called before destroy(). Your close handler should release resources; destroy is for final cleanup.
destroy() is always called, even if the connection was never opened.
Your Rust type is dropped after destroy() returns, even if destroy() panics.

🔗std::io Adapters

If your type already implements std::io::Read, std::io::Write, or std::io::Seek, you can skip implementing RConnectionImpl entirely. The RConnectionIo builder wraps any std::io type automatically:

use miniextendr_api::connection::RConnectionIo;
use std::io::Cursor;

#[miniextendr]
pub fn cursor_connection(data: Vec<u8>) -> SEXP {
    RConnectionIo::new(Cursor::new(data))
        .description("Rust Cursor")
        .mode("r+b")
        .build_read_write_seek()
}

🔗Available Build Methods

Choose the build method matching your type’s trait bounds:

Method	Requires	Capabilities
`build_read()`	`Read`	Read-only
`build_write()`	`Write`	Write-only
`build_read_write()`	`Read + Write`	Read and write
`build_read_seek()`	`Read + Seek`	Read with seeking
`build_write_seek()`	`Write + Seek`	Write with seeking
`build_read_write_seek()`	`Read + Write + Seek`	Full capabilities
`build_bufread()`	`BufRead`	Buffered read (optimized `fgetc`)

Capabilities (can_read, can_write, can_seek) are auto-detected from the adapter type. You can override them:

RConnectionIo::new(my_reader)
    .can_write(false)  // Override auto-detection
    .build_read()

🔗Adapter Types

The adapters are also available directly if you need finer control:

Adapter	Wraps	Capabilities
`IoRead<T>`	`Read`	Read
`IoWrite<T>`	`Write`	Write, flush
`IoReadWrite<T>`	`Read + Write`	Read, write, flush
`IoReadSeek<T>`	`Read + Seek`	Read, seek
`IoWriteSeek<T>`	`Write + Seek`	Write, seek, flush
`IoReadWriteSeek<T>`	`Read + Write + Seek`	Read, write, seek, flush
`IoBufRead<T>`	`BufRead`	Read, optimized fgetc

use miniextendr_api::connection::{IoRead, RCustomConnection};

let adapter = IoRead::new(my_reader);
let conn = RCustomConnection::new()
    .description("adapted reader")
    .mode("rb")
    .can_read(true)
    .build(adapter);

🔗Safety & Version Checking

R’s connection C API is explicitly unstable. From R’s R_ext/Connections.h:

“We explicitly reserve the right to change the connection implementation without a compatibility layer.”

miniextendr mitigates this with two layers of defense:

🔗Compile-Time ABI Check

pub const EXPECTED_CONNECTIONS_VERSION: c_int = 1;

pub fn check_connections_version() {
    assert_eq!(
        R_CONNECTIONS_VERSION, EXPECTED_CONNECTIONS_VERSION,
        "R_CONNECTIONS_VERSION mismatch"
    );
}

This is called automatically by RCustomConnection::build(). If the R headers used during compilation have a different R_CONNECTIONS_VERSION, the assertion fails at compile time (both values are const), catching ABI mismatches before any unsafe code runs.

🔗Runtime Version Check

For additional safety, check_connections_runtime() probes the running R version:

use miniextendr_api::connection::check_connections_runtime;

unsafe {
    match check_connections_runtime() {
        Ok(()) => { /* R >= 4.3.0, connections supported */ }
        Err(msg) => panic!("Connections not available: {msg}"),
    }
}

This calls R.Version() at runtime and verifies R >= 4.3.0 (when R_CONNECTIONS_VERSION = 1 was stabilized). Use this in package initialization or before creating your first connection to fail early with a clear error message.

The Rconn struct layout (#[repr(C)]) mirrors R’s struct Rconn exactly. Any field reordering or resizing in a future R version would cause memory corruption. The version checks are the first line of defense.

🔗Capability Probing

You can query the capabilities and state of a connection at runtime using ConnectionCapabilities:

use miniextendr_api::connection::ConnectionCapabilities;

unsafe {
    let caps = ConnectionCapabilities::from_sexp(conn_sexp);
    println!("read={}, write={}, seek={}, text={}, open={}, blocking={}",
        caps.can_read, caps.can_write, caps.can_seek,
        caps.is_text, caps.is_open, caps.is_blocking);
}

🔗Fields

Field	Type	Description
`can_read`	`bool`	Connection supports reading
`can_write`	`bool`	Connection supports writing
`can_seek`	`bool`	Connection supports seeking
`is_text`	`bool`	Text mode (`true`) vs binary mode (`false`)
`is_open`	`bool`	Connection is currently open
`is_blocking`	`bool`	Connection uses blocking I/O

🔗Construction

Method	Input	Notes
`from_sexp(conn_sexp)`	R connection SEXP	Calls `R_GetConnection` internally
`from_handle(handle)`	`Rconnection` handle	When you already have the handle

🔗Binary Mode Helper

use miniextendr_api::connection::{is_binary_mode, connection_mode};

unsafe {
    if is_binary_mode(conn_sexp) {
        // Binary mode: "rb", "wb", "r+b", etc.
    }

    let mode = connection_mode(conn_sexp);  // e.g., "rb+"
}

is_binary_mode() checks if the connection’s mode string contains 'b'. connection_mode() returns the full mode string.

🔗Connection Description

use miniextendr_api::connection::connection_description;

unsafe {
    let desc = connection_description(conn_sexp); // e.g., "my data source"
}

🔗Error Handling

🔗Panic Safety

All callback trampolines are wrapped in catch_connection_panic, which catches Rust panics and returns a safe fallback value:

Callback	Fallback on Panic
`open`	`FALSE` (open failed)
`close`	(no-op)
`destroy`	(still drops Box, always)
`read`	0 (EOF)
`write`	0 (no bytes written)
`fgetc`	-1 (EOF)
`seek`	-1.0 (seek failed)
`truncate`	(no-op)
`flush`	-1 (failure)
`vfprintf`	-1 (failure)

Panics are caught, telemetry is fired via panic_telemetry::fire(), and R receives a non-fatal error indicator. The connection remains in a consistent state.

🔗Destroy Always Runs

The destroy trampoline has special handling: even if destroy() panics, the Box<T> is still freed and the private pointer is set to null:

// Always drop the boxed state, even if destroy() panicked
let _ = unsafe { Box::from_raw(private as *mut T) };
(*conn).private = std::ptr::null_mut();

🔗std::io Error Handling

The adapter types convert std::io::Error to connection-friendly values:

Read errors return 0 (EOF)
Write errors return 0 (no bytes written)
Flush errors return -1 (failure)
Seek errors return -1.0 (failure)

🔗Trampoline Architecture

R’s connection system expects C function pointers for each callback. miniextendr bridges these to Rust trait methods using monomorphized trampolines:

R calls C function pointer
  |
  v
trampoline<T>(conn: *mut Rconn) -> ReturnType
  |
  +--> catch_connection_panic(fallback, || {
  |      let state = get_state::<T>(conn);  // Extract &mut T from conn.private
  |      state.method(args)                  // Call trait method
  |    })
  |
  v
Return value (or fallback on panic)

Each trampoline is a generic unsafe extern "C-unwind" fn parameterized by T: RConnectionImpl. When you call build::<T>(state), the compiler generates concrete function pointers for your specific type. The C-unwind ABI allows panics to propagate up to the catch_connection_panic boundary.

The private field of the Rconn struct stores a Box::into_raw(Box::new(state)) pointer. Trampolines cast this back to &mut T via get_state().

🔗Complete Examples

🔗In-Memory Read/Write Buffer

A buffer that supports reading, writing, and seeking – similar to textConnection() but in binary mode:

use miniextendr_api::connection::{RConnectionImpl, RCustomConnection};
use miniextendr_api::ffi::SEXP;
use miniextendr_api::miniextendr;

struct MemoryBuffer {
    data: Vec<u8>,
    position: usize,
}

impl RConnectionImpl for MemoryBuffer {
    fn open(&mut self) -> bool {
        self.position = 0;
        true
    }

    fn read(&mut self, buf: &mut [u8]) -> usize {
        let available = self.data.len().saturating_sub(self.position);
        let to_read = buf.len().min(available);
        if to_read > 0 {
            buf[..to_read].copy_from_slice(
                &self.data[self.position..self.position + to_read]
            );
            self.position += to_read;
        }
        to_read
    }

    fn write(&mut self, buf: &[u8]) -> usize {
        let end_pos = self.position + buf.len();
        if end_pos > self.data.len() {
            self.data.resize(end_pos, 0);
        }
        self.data[self.position..end_pos].copy_from_slice(buf);
        self.position = end_pos;
        buf.len()
    }

    fn seek(&mut self, where_: f64, origin: i32, _rw: i32) -> f64 {
        if where_.is_nan() {
            return self.position as f64;
        }
        let new_pos = match origin {
            1 => where_.max(0.0) as usize,                        // Start
            2 => (self.position as isize + where_ as isize) as usize, // Current
            3 => (self.data.len() as isize + where_ as isize) as usize, // End
            _ => return -1.0,
        };
        self.position = new_pos.min(self.data.len());
        self.position as f64
    }

    fn flush(&mut self) -> i32 { 0 }
}

#[miniextendr]
pub fn memory_connection() -> SEXP {
    RCustomConnection::new()
        .description("memory buffer")
        .mode("r+b")
        .class_name("memoryConnection")
        .can_read(true)
        .can_write(true)
        .can_seek(true)
        .text(false)
        .build(MemoryBuffer { data: Vec::new(), position: 0 })
}

conn <- memory_connection()
writeBin(charToRaw("Hello, World!"), conn)
seek(conn, 0)
rawToChar(readBin(conn, "raw", 13))
#> [1] "Hello, World!"
close(conn)

🔗Streaming Generator

A read-only connection that generates data on-the-fly:

struct CounterConnection {
    current: i32,
    max: i32,
    buffer: Vec<u8>,
    buffer_pos: usize,
}

impl RConnectionImpl for CounterConnection {
    fn read(&mut self, buf: &mut [u8]) -> usize {
        // Refill internal buffer when exhausted
        if self.buffer_pos >= self.buffer.len() {
            if self.current > self.max {
                return 0; // EOF
            }
            self.buffer = format!("{}\n", self.current).into_bytes();
            self.buffer_pos = 0;
            self.current += 1;
        }

        let available = self.buffer.len() - self.buffer_pos;
        let n = buf.len().min(available);
        buf[..n].copy_from_slice(&self.buffer[self.buffer_pos..self.buffer_pos + n]);
        self.buffer_pos += n;
        n
    }
}

#[miniextendr]
pub fn counter_connection(start: i32, end: i32) -> SEXP {
    RCustomConnection::new()
        .description(&format!("counter {}:{}", start, end))
        .mode("r")
        .class_name("counterConnection")
        .can_read(true)
        .text(true)
        .build(CounterConnection {
            current: start, max: end,
            buffer: Vec::new(), buffer_pos: 0,
        })
}

conn <- counter_connection(1L, 5L)
readLines(conn)
#> [1] "1" "2" "3" "4" "5"
close(conn)

🔗std::io Cursor Adapter

Wrap Rust’s std::io::Cursor with zero boilerplate:

use miniextendr_api::connection::RConnectionIo;
use std::io::Cursor;

#[miniextendr]
pub fn cursor_connection(data: Vec<u8>) -> SEXP {
    RConnectionIo::new(Cursor::new(data))
        .description("Rust Cursor")
        .mode("r+b")
        .class_name("cursorConnection")
        .build_read_write_seek()
}

🔗Transform Connection

Apply a byte-level transformation while reading:

struct TransformConnection<F: Fn(u8) -> u8 + 'static> {
    data: Vec<u8>,
    position: usize,
    transform: F,
}

impl<F: Fn(u8) -> u8 + 'static> RConnectionImpl for TransformConnection<F> {
    fn read(&mut self, buf: &mut [u8]) -> usize {
        let available = self.data.len().saturating_sub(self.position);
        let n = buf.len().min(available);
        for i in 0..n {
            buf[i] = (self.transform)(self.data[self.position + i]);
        }
        self.position += n;
        n
    }
}

#[miniextendr]
pub fn uppercase_connection(text: &str) -> SEXP {
    RCustomConnection::new()
        .description("uppercase transform")
        .mode("r")
        .class_name("uppercaseConnection")
        .can_read(true)
        .text(true)
        .build(TransformConnection {
            data: text.as_bytes().to_vec(),
            position: 0,
            transform: |b| if b.is_ascii_lowercase() { b.to_ascii_uppercase() } else { b },
        })
}

🔗Helper Functions

For working with existing R connections from Rust (e.g., when a user passes a connection as an argument):

use miniextendr_api::connection::{get_connection, read_connection, write_connection};

// Get a connection handle from an R SEXP
let handle = unsafe { get_connection(conn_sexp) };

// Read from it
let mut buf = [0u8; 1024];
let n = unsafe { read_connection(handle, &mut buf) };

// Write to it
let data = b"output data\n";
let written = unsafe { write_connection(handle, data) };

These are thin wrappers around R_GetConnection, R_ReadConnection, and R_WriteConnection.

🔗Limitations

Experimental API. R may change the connection ABI in any release. The compile-time version check catches this, but you may need to update miniextendr when upgrading R.
No vfprintf by default. The vfprintf callback receives raw C varargs (va_list), which are not portable in Rust. The default returns -1. R rarely calls this – it prefers the write callback.
Not Send/Sync. Connections run on the main R thread. Your RConnectionImpl type does not need to be thread-safe.
GC protection. The SEXP returned by build() must be protected from R’s garbage collector if you store it. Returning it directly from a #[miniextendr] function handles this automatically.
No stat support. R’s connection C API does not include a stat callback (file size, modification time, etc.). If you need stat-like information, track it in your RConnectionImpl type or query it through a separate #[miniextendr] function.

🔗See Also

FEATURES.md – Feature flags reference (connections)
THREADS.md – Thread safety and the worker thread model
ERROR_HANDLING.md – Panic handling across the FFI boundary
Test fixtures: rpkg/src/rust/connection_tests.rs