Anthropic recently conducted an internal test exploring the potential of automated commerce. The initiative featured a digital classifieds platform where software agents acted on behalf of human participants to negotiate and execute transactions involving actual products and currency.
The initiative, internally dubbed Project Deal, operated as a preliminary trial involving 69 staff members who volunteered to participate. Each individual received a $100 allowance, distributed through gift cards, to purchase items from their colleagues. Despite the limited scope of the trial, the organization noted the system functioned surprisingly well. Throughout the test, the automated representatives successfully completed 186 transactions, generating a combined exchange value exceeding $4,000.
Marketplace Variations and Performance
Researchers structured the evaluation around four distinct trading environments to analyze different variables. One of these environments functioned as the primary live market, utilizing the most sophisticated artificial intelligence model available to the firm. In this specific setup, all negotiated agreements were officially honored once the trial concluded. The remaining three environments were established purely for observational research.
The data revealed a clear advantage for participants utilizing more capable software. According to the findings, superior models consistently secured measurably better results for their human clients. However, an interesting behavioral observation emerged: participants generally remained unaware of this performance disparity. This dynamic highlights a potential future issue regarding software quality discrepancies, where individuals utilizing inferior automated assistants might suffer financial disadvantages without ever realizing they negotiated a poor deal.
Furthermore, the research indicated that the baseline prompts provided to the automated assistants had little to no measurable impact on the final outcomes. Whether examining the probability of a successful transaction or the final agreed-upon price, the initial directives given to the software did not significantly alter the behavior of the trading programs.



