Skip to main content
IP addresses are unique identifiers for devices communicating over networks. Standing for Internet Protocol, they adhere to standardized formats that enable data routing across the internet. In data collaboration, IP addresses provide signals for geographic targeting, fraud detection, and network-level analytics.

IP address formats

IPv4

The most common format, IPv4 addresses consist of four octets (8-bit numbers) separated by periods. Each octet ranges from 0 to 255.
192.168.1.1
74.125.224.72
Characteristics:
  • 32-bit address space
  • ~4.3 billion possible addresses
  • Running out of available addresses globally

IPv6

Developed to expand the address pool, IPv6 uses a 128-bit address format with eight groups of four hexadecimal digits separated by colons.
2001:0db8:85a3:0000:0000:8a2e:0370:7334
Characteristics:
  • 128-bit address space
  • ~340 undecillion possible addresses
  • Growing adoption but not yet universal
IPv6 addresses can be abbreviated by omitting leading zeros and collapsing consecutive zero groups with ::. The address above could be written as 2001:db8:85a3::8a2e:370:7334.

Types of IP addresses

Public vs. private

TypeDescriptionExample Ranges
PublicGlobally routable, assigned by ISPsMost addresses outside reserved ranges
PrivateUsed within local networks, not routable on internet10.x.x.x, 172.16-31.x.x, 192.168.x.x
Public IP addresses identify devices on the open internet and are the primary type used in data collaboration. Private addresses are used within homes and organizations and are not visible externally.

Static vs. dynamic

TypeDescriptionUse Cases
StaticRemains constant over timeServers, business connections
DynamicChanges periodically, assigned by DHCPResidential connections, mobile
Most consumer internet connections use dynamic IPs, which can change daily or when the router restarts. This affects data collaboration because the same IP address might represent different users over time.

Dedicated vs. shared

TypeDescriptionContext
DedicatedSingle device or organizationDirect server access
SharedMultiple users behind the same IPNAT, VPNs, corporate networks, mobile carriers
Carrier-grade NAT (CGNAT) and VPNs mean that many users may share the same public IP address, making IP-based identification less precise.

IP addresses in data collaboration

Geographic signals

IP addresses can be mapped to approximate geographic locations:
  • Country: High accuracy
  • Region/State: Generally reliable
  • City: Moderate accuracy
  • Postal code: Less reliable
  • Street level: Not reliable
IP geolocation is useful for:
  • Regulatory compliance (geo-restrictions)
  • Regional targeting
  • Fraud detection (location mismatches)
IP geolocation is approximate and can be inaccurate, especially for mobile users and VPN users. Never rely on IP location for critical decisions requiring precise location.

Fraud and security signals

IP addresses provide useful signals for fraud detection:
  • Known proxy/VPN IP ranges
  • Data center vs. residential IPs
  • Bot traffic identification
  • Rate limiting and abuse detection

Privacy considerations

IP addresses are considered personal data under GDPR and may be subject to other privacy regulations:
  • They can identify households or organizations
  • Combined with other data, they may identify individuals
  • Collection requires appropriate legal basis

Working with IP addresses

Normalization

When matching IP data across sources:
  • Validate format (proper IPv4 or IPv6 structure)
  • Handle leading zeros consistently
  • Consider canonicalizing IPv6 addresses

Storage

FormatStorage SizeNotes
StringVariableHuman-readable, flexible
Integer (IPv4)4 bytesEfficient for IPv4 only
Binary (IPv6)16 bytesSupports both versions

Matching limitations

IP addresses are session-level identifiers, not user-level:
  • Dynamic IPs change over time
  • Shared IPs represent multiple users
  • Mobile IPs change as users move between networks
For user-level matching, IP addresses are typically combined with other signals or used as one factor in probabilistic matching.