Spaces:

RAVENOCC
/

Price_Comparison_Agent

Runtime error

App Files Files Community

jkanishkha0305 commited on Jun 10, 2025

Commit

1cb59e8

0 Parent(s):

Price Comparison Agent Huggingface Spaces

Browse files

Files changed (21) hide show

.DS_Store +0 -0
.gitattributes +35 -0
Dockerfile +21 -0
LICENSE +21 -0
README.md +138 -0
app.py +125 -0
requirements.txt +6 -0
src/__init__.py +0 -0
src/__pycache__/__init__.cpython-310.pyc +0 -0
src/__pycache__/agents.cpython-310.pyc +0 -0
src/__pycache__/config.cpython-310.pyc +0 -0
src/__pycache__/crew.cpython-310.pyc +0 -0
src/__pycache__/tasks.cpython-310.pyc +0 -0
src/__pycache__/tools.cpython-310.pyc +0 -0
src/agents.py +52 -0
src/config.py +23 -0
src/crew.py +14 -0
src/streamlit_app.py +125 -0
src/tasks.py +34 -0
src/tools.py +5 -0
test.ipynb +284 -0

.DS_Store ADDED Viewed

Binary file (6.15 kB). View file

.gitattributes ADDED Viewed

	@@ -0,0 +1,35 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text

Dockerfile ADDED Viewed

	@@ -0,0 +1,21 @@

+FROM python:3.9-slim
+WORKDIR /app
+RUN apt-get update && apt-get install -y \
+    build-essential \
+    curl \
+    software-properties-common \
+    git \
+    && rm -rf /var/lib/apt/lists/*
+COPY requirements.txt ./
+COPY src/ ./src/
+RUN pip3 install -r requirements.txt
+EXPOSE 8501
+HEALTHCHECK CMD curl --fail http://localhost:8501/_stcore/health
+ENTRYPOINT ["streamlit", "run", "src/streamlit_app.py", "--server.port=8501", "--server.address=0.0.0.0"]

LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2025 J.Kanishkha
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

README.md ADDED Viewed

	@@ -0,0 +1,138 @@

+# 🛍️ **Price Comparison Agent** 🚀 using CrewAI and Cerebras 🤖
+Welcome to the **Price Comparison Agent** project! 🎉 This tool automatically compares prices for your favorite products across various e-commerce platforms. Powered by **CrewAI** for orchestration, **Cerebras LLM** for high-performance language processing, and cutting-edge scraping tools, it gives you insights like never before! 💡
+## 📚 Table of Contents
+- [🌟 Overview](#overview)
+- [🛠️ Technologies](#technologies)
+- [📂 Project Structure](#project-structure)
+- [⚙️ Installation](#installation)
+- [🔑 Setup Environment Variables](#setup-environment-variables)
+- [🚀 Usage](#usage)
+- [🚧 Future Improvements](#future-improvements)
+- [📝 License](#license)
+---
+## 🌟 Overview
+This project automates the **price comparison** process using an orchestrated system of agents and tasks built on **CrewAI**! 🎯 The agents work together to:
+1. **Collect pricing data** 🛒 from various e-commerce platforms.
+2. **Clean** 🧹 the data for consistency and accuracy.
+3. **Compare prices** 💲 to find the best deal.
+4. **Generate a detailed report** 📊 with actionable insights.
+### 🎯 Agents:
+- **Search Agent:** Scours the web for price data on your selected product.
+- **Data Cleaner:** Scrubs the collected data to ensure it’s clean and ready for analysis.
+- **Price Comparison Expert:** Identifies the lowest price and price-to-value ratio.
+- **Reporting Agent:** Summarizes all findings into a professional market insights report.
+### 🧰 Tools:
+- **SerperDevTool:** For scraping e-commerce platforms and gathering product details 🔍.
+- **ScrapeWebsiteTool:** For extracting additional data from specific product pages 🌐.
+### ⚡ Cerebras LLM:
+Powering our agents with state-of-the-art **language understanding** to generate insights! 🤖
+---
+## 🛠️ Technologies
+Here’s what makes the magic happen 🔮:
+- **CrewAI** 🧑‍💼: Orchestrating agents for intelligent automation.
+- **Cerebras LLM** 🧠: High-performance language model for processing and reporting.
+- **Streamlit** 🌐: Interactive web app for displaying results.
+- **SerperDevTool & ScrapeWebsiteTool** 🔧: Web scraping tools to collect data.
+- **Python** 🐍: The backbone of this project.
+---
+## Demo
+![main](assets/crewcerebras1.png)
+![main](assets/crewcerebras2.png)
+## 📂 Project Structure
+Here’s the folder breakdown 🗂️:
+```bash
+Price-Comparison-Agent/
+│
+├── src/
+│   ├── agents.py
+│   ├── config.py
+│   ├── crew.py
+│   ├── task.py
+│   ├── tools.py
+│
+├── app.py
+├── .env
+├── requirements.txt
+└── README.md
+```
+---
+## ⚙️ Installation
+Ready to get started? 🏁 Follow these simple steps to install and run the project:
+1. **Clone the Repository:**
+   ```bash
+   git clone https://github.com/Jkanishkha0305/Price-Comparison-Agent.git
+   cd Price-Comparison-Agent
+   ```
+2. **Install Dependencies:**
+Set up a virtual environment and install the required Python packages:
+    ```bash
+    python3 -m venv venv
+    source venv/bin/activate
+    pip install -r requirements.txt
+    ```
+3. **Setup Environment Variables:**
+    Create a .env file in the root directory. Add your API keys:
+    ```bash
+    CEREBRAS_API_KEY=your_cerebras_api_key
+    SERPER_API_KEY=your_serper_api_key
+    ```
+4. **Install Dependencies:**
+    Fire up the app using Streamlit:
+    ```bash
+    streamlit run app.py
+    ```
+    Go to http://localhost:8501 in your browser to start interacting with the tool! 🌐
+## 🚀 Usage
+Once you’re running the Streamlit app, simply:
+1. **Enter Product Name** 🏷️ (e.g., “Sony WH-1000XM5”).
+2. **Enter Country** 🌍 (e.g., “United States”).
+3. Click **"Compare Prices"** 💸.
+The app will:
+- **Search for prices** across major platforms 🛍️.
+- **Clean and standardize the data** 🧼.
+- **Compare the lowest prices** and show you the best deals 📊.
+- **Generate a detailed report** with pricing trends and recommendations 📑.
+## 🚧 Future Improvements
+Let’s make this tool even better! Here’s what we plan to add next:
+- **🌎 Multi-country Support**: Compare prices across different regions.
+- **🛍️ More Platforms**: Expand to include even more e-commerce platforms.
+- **🔄 Real-time Updates**: Keep prices up-to-date in real time.
+- **📱 Price Alerts**: Notify users when a product’s price drops below a certain threshold.
+## Teammates
+1. **Kanishkha Jaisankar**
+2. **Nirbhaya Reddy Gopavaram**
+3. **Nitharshan Coimbatore Venkatesan**
+## 📝 License
+This project is licensed under the MIT License. See the LICENSE file for details. ⚖️

app.py ADDED Viewed

	@@ -0,0 +1,125 @@

+import streamlit as st
+import sys
+import os
+import json
+from src.config import cerebras_llm
+from src.crew import create_crew  # Assuming you have a function that creates the crew setup
+# Ensure the src folder is part of the Python path
+sys.path.insert(0, os.path.abspath(os.path.join(os.path.dirname(__file__), 'src')))
+# Function to run the entire crew
+def run_crew(product_name, country, model_name):
+    # Create the crew with agents and tasks
+    event_management_crew = create_crew(product_name, country, model_name)
+    # Format the input for the crew
+    event_details = {'product': product_name, 'country': country, 'model': model_name}
+    # Execute Crew (this will run all the tasks)
+    event_analysis = event_management_crew.kickoff(inputs=event_details)
+    return event_analysis
+# Function to clean the output and remove unwanted fields
+def clean_output(output):
+    if isinstance(output, dict):
+        output = json.dumps(output, indent=4)
+        output = output.replace('"pydantic":null,', '')
+        output = output.replace('"json_dict":null,', '')
+        output = output.replace('"tasks_output":[]', '')
+        output = output.replace('"token_usage":', '')
+        return output
+    return output
+# Streamlit Page Config
+st.set_page_config(page_title="AI Price Comparator", page_icon="🛒", layout="wide")
+# Initialize session states for history and reports
+if 'history' not in st.session_state:
+    st.session_state.history = []
+if 'reports' not in st.session_state:
+    st.session_state.reports = {}
+# Sidebar for API Key Uploads, History, and Model Selection
+with st.sidebar:
+    st.header("🔑 **API Keys**")
+    cerebras_api_key = st.text_input("🧠 Cerebras API Key", type="password")
+    serper_api_key = st.text_input("🔍 Serper API Key", type="password")
+    # Model Selection
+    # Sidebar Model Selection
+    st.header("🧠 **Select Model**")
+    model_name = st.selectbox(
+        "Choose a Model",
+        ["cerebras/llama-3.1-8b", "cerebras/llama-3.3-70b", "cerebras/deepseek-r1-distill-llama-70b"]
+    )
+    # History Tab
+    st.header("📜 **Search History**")
+    if st.session_state.history:
+        for idx, search in enumerate(st.session_state.history):
+            if st.button(f"🔎 {search['product_name']} in {search['country']}", key=f"search_{idx}"):
+                st.session_state.selected_search = search  # Store selected search
+                st.rerun()
+    else:
+        st.write("No previous searches yet.")
+# **Main UI**
+st.markdown("## 🚀 **Welcome to the Price Comparison Tool!** 🛒")
+st.write("Enter the product details below to compare prices across multiple platforms. 📉")
+# **Inputs for product and country (always visible)**
+selected_search = st.session_state.get('selected_search', {})
+product_name = st.text_input(
+    "💡 **Product Name**",
+    selected_search.get('product_name', "Sony WH-1000XM5")
+)
+country = st.text_input(
+    "🌍 **Country**",
+    selected_search.get('country', "United States")
+)
+# **Button to compare prices**
+if st.button("🔍 **Compare Prices**", help="Click to analyze prices and get a detailed comparison"):
+    if product_name and country:
+        st.write(f"🛒 **Analyzing prices for** **{product_name}** in **{country}**... 📈")
+        # Run the crew and get the results
+        event_analysis = run_crew(product_name, country, model_name)
+        # Clean the output and display the results
+        cleaned_output = clean_output(event_analysis)
+        st.subheader("📊 **Price Comparison Report**")
+        st.markdown(cleaned_output)
+        # Store the search and report
+        search_key = f"{product_name}_{country}"
+        search_data = {'product_name': product_name, 'country': country, 'model_name': model_name}
+        if search_data not in st.session_state.history:
+            st.session_state.history.append(search_data)
+        st.session_state.reports[search_key] = cleaned_output  # Save the report
+        # Clear selected search after displaying results
+        st.session_state.selected_search = {'product_name': product_name, 'country': country}
+    else:
+        st.error("❌ Please enter both product name and country.")
+# **Display saved report if a past search is selected**
+search_key = f"{product_name}_{country}"
+if search_key in st.session_state.reports:
+    st.subheader("📊 **Saved Price Comparison Report**")
+    st.markdown(st.session_state.reports[search_key])
+    # **Download Button for the Report**
+    # report_json = st.session_state.reports[search_key].encode('utf-8')
+    # st.download_button(
+    #     label="📥 Download Report",
+    #     data=report_json,
+    #     file_name=f"{search_key}.json",
+    #     mime="application/json"
+    # )

requirements.txt ADDED Viewed

	@@ -0,0 +1,6 @@

+crewai
+streamlit
+langchain-core
+langchain
+langchain-cerebras

src/__init__.py ADDED Viewed

File without changes

src/__pycache__/__init__.cpython-310.pyc ADDED Viewed

Binary file (150 Bytes). View file

src/__pycache__/agents.cpython-310.pyc ADDED Viewed

Binary file (1.98 kB). View file

src/__pycache__/config.cpython-310.pyc ADDED Viewed

Binary file (675 Bytes). View file

src/__pycache__/crew.cpython-310.pyc ADDED Viewed

Binary file (604 Bytes). View file

src/__pycache__/tasks.cpython-310.pyc ADDED Viewed

Binary file (2.11 kB). View file

src/__pycache__/tools.cpython-310.pyc ADDED Viewed

Binary file (267 Bytes). View file

src/agents.py ADDED Viewed

	@@ -0,0 +1,52 @@

+from crewai import Agent
+from src.config import cerebras_llm
+from src.tools import search_tool, scrape_tool
+def create_agents(product_name, country, model_name):
+    search = Agent(
+        role="E-Commerce Market Research Analyst",
+        goal=f"Provide up-to-date market analysis of {product_name} from e-commerce platforms in {country}. Model: {model_name}",
+        backstory="An expert analyst with a keen eye for market trends",
+        tools=[search_tool, scrape_tool],
+        verbose=True,
+        llm=cerebras_llm
+    )
+    data_cleaner = Agent(
+        role="Data Cleaning Specialist",
+        goal=f"Ensure all price values for {product_name} are accurate, properly formatted, and free of inconsistencies.",
+        backstory=(
+            "An experienced data analyst with a strong background in data preprocessing, "
+            "error detection, and price standardization. With expertise in handling messy datasets, "
+            "you identify and clean incorrect, missing, or inconsistent price values, ensuring the data is reliable for further analysis."
+        ),
+        tools=[],
+        verbose=True,
+        llm=cerebras_llm
+    )
+    comparison = Agent(
+        role="Price Comparison Expert",
+        goal=f"Analyze and compare {product_name} prices to identify the lowest price available.",
+        backstory=(
+            "A meticulous price analyst with expertise in comparing product prices across different sources. "
+            "You efficiently process pricing data, highlight discrepancies, and determine the best deal for consumers."
+        ),
+        tools=[],
+        verbose=True,
+        llm=cerebras_llm
+    )
+    reporting_agent = Agent(
+        role="Market Insights Reporter",
+        goal=f"Generate a comprehensive report summarizing price trends, differences, and the best available deals for {product_name}.",
+        backstory=(
+            "A skilled data journalist with experience in analyzing pricing trends and market fluctuations. "
+            "You transform raw pricing data into insightful reports, providing actionable insights on cost-effective options."
+        ),
+        tools=[],
+        verbose=True,
+        llm=cerebras_llm
+    )
+    return search, data_cleaner, comparison, reporting_agent

src/config.py ADDED Viewed

	@@ -0,0 +1,23 @@

+import os
+from dotenv import load_dotenv
+from crewai import LLM
+load_dotenv()
+# Load API keys from environment variables
+CEREBRAS_API_KEY = os.getenv("CEREBRAS_API_KEY")
+SERPER_API_KEY = os.getenv("SERPER_API_KEY")
+if not CEREBRAS_API_KEY:
+    raise ValueError("Missing Cerebras API Key! Set CEREBRAS_API_KEY in environment variables.")
+if not SERPER_API_KEY:
+    raise ValueError("Missing Serper API Key! Set SERPER_API_KEY in environment variables.")
+cerebras_llm = LLM(
+    model="cerebras/llama-3.3-70b",
+    temperature=0.7,
+    max_tokens=18192,
+    api_key=CEREBRAS_API_KEY,
+    base_url="https://api.cerebras.ai/v1",
+)

src/crew.py ADDED Viewed

	@@ -0,0 +1,14 @@

+from src.tasks import create_tasks
+from crewai import Crew
+def create_crew(product_name, country, model_name):
+    # Create agents and tasks using the create_tasks function
+    search, data_cleaner, comparison, reporting_agent, search_task, cleaning_task, comparison_task, reporting_task = create_tasks(product_name, country, model_name)
+    # Define the crew (agents and tasks)
+    event_management_crew = Crew(
+        agents=[search, data_cleaner, comparison, reporting_agent],
+        tasks=[search_task, cleaning_task, comparison_task, reporting_task],
+        verbose=True,
+    )
+    return event_management_crew

src/streamlit_app.py ADDED Viewed

	@@ -0,0 +1,125 @@

+import streamlit as st
+import sys
+import os
+import json
+from src.config import cerebras_llm
+from src.crew import create_crew  # Assuming you have a function that creates the crew setup
+# Ensure the src folder is part of the Python path
+sys.path.insert(0, os.path.abspath(os.path.join(os.path.dirname(__file__), 'src')))
+# Function to run the entire crew
+def run_crew(product_name, country, model_name):
+    # Create the crew with agents and tasks
+    event_management_crew = create_crew(product_name, country, model_name)
+    # Format the input for the crew
+    event_details = {'product': product_name, 'country': country, 'model': model_name}
+    # Execute Crew (this will run all the tasks)
+    event_analysis = event_management_crew.kickoff(inputs=event_details)
+    return event_analysis
+# Function to clean the output and remove unwanted fields
+def clean_output(output):
+    if isinstance(output, dict):
+        output = json.dumps(output, indent=4)
+        output = output.replace('"pydantic":null,', '')
+        output = output.replace('"json_dict":null,', '')
+        output = output.replace('"tasks_output":[]', '')
+        output = output.replace('"token_usage":', '')
+        return output
+    return output
+# Streamlit Page Config
+st.set_page_config(page_title="AI Price Comparator", page_icon="🛒", layout="wide")
+# Initialize session states for history and reports
+if 'history' not in st.session_state:
+    st.session_state.history = []
+if 'reports' not in st.session_state:
+    st.session_state.reports = {}
+# Sidebar for API Key Uploads, History, and Model Selection
+with st.sidebar:
+    st.header("🔑 **API Keys**")
+    cerebras_api_key = st.text_input("🧠 Cerebras API Key", type="password")
+    serper_api_key = st.text_input("🔍 Serper API Key", type="password")
+    # Model Selection
+    # Sidebar Model Selection
+    st.header("🧠 **Select Model**")
+    model_name = st.selectbox(
+        "Choose a Model",
+        ["cerebras/llama-3.1-8b", "cerebras/llama-3.3-70b", "cerebras/deepseek-r1-distill-llama-70b"]
+    )
+    # History Tab
+    st.header("📜 **Search History**")
+    if st.session_state.history:
+        for idx, search in enumerate(st.session_state.history):
+            if st.button(f"🔎 {search['product_name']} in {search['country']}", key=f"search_{idx}"):
+                st.session_state.selected_search = search  # Store selected search
+                st.rerun()
+    else:
+        st.write("No previous searches yet.")
+# **Main UI**
+st.markdown("## 🚀 **Welcome to the Price Comparison Tool!** 🛒")
+st.write("Enter the product details below to compare prices across multiple platforms. 📉")
+# **Inputs for product and country (always visible)**
+selected_search = st.session_state.get('selected_search', {})
+product_name = st.text_input(
+    "💡 **Product Name**",
+    selected_search.get('product_name', "Sony WH-1000XM5")
+)
+country = st.text_input(
+    "🌍 **Country**",
+    selected_search.get('country', "United States")
+)
+# **Button to compare prices**
+if st.button("🔍 **Compare Prices**", help="Click to analyze prices and get a detailed comparison"):
+    if product_name and country:
+        st.write(f"🛒 **Analyzing prices for** **{product_name}** in **{country}**... 📈")
+        # Run the crew and get the results
+        event_analysis = run_crew(product_name, country, model_name)
+        # Clean the output and display the results
+        cleaned_output = clean_output(event_analysis)
+        st.subheader("📊 **Price Comparison Report**")
+        st.markdown(cleaned_output)
+        # Store the search and report
+        search_key = f"{product_name}_{country}"
+        search_data = {'product_name': product_name, 'country': country, 'model_name': model_name}
+        if search_data not in st.session_state.history:
+            st.session_state.history.append(search_data)
+        st.session_state.reports[search_key] = cleaned_output  # Save the report
+        # Clear selected search after displaying results
+        st.session_state.selected_search = {'product_name': product_name, 'country': country}
+    else:
+        st.error("❌ Please enter both product name and country.")
+# **Display saved report if a past search is selected**
+search_key = f"{product_name}_{country}"
+if search_key in st.session_state.reports:
+    st.subheader("📊 **Saved Price Comparison Report**")
+    st.markdown(st.session_state.reports[search_key])
+    # **Download Button for the Report**
+    # report_json = st.session_state.reports[search_key].encode('utf-8')
+    # st.download_button(
+    #     label="📥 Download Report",
+    #     data=report_json,
+    #     file_name=f"{search_key}.json",
+    #     mime="application/json"
+    # )

src/tasks.py ADDED Viewed

	@@ -0,0 +1,34 @@

+from crewai import Task
+from src.agents import create_agents
+def create_tasks(product_name, country, model_name):
+    # Get the agents
+    search, data_cleaner, comparison, reporting_agent = create_agents(product_name, country, model_name)
+    # Task definitions
+    search_task = Task(
+        description=f"Collect current pricing data for {product_name} from at least 3 major e-commerce platforms in {country}. Include product name, model, specifications, price, and any ongoing promotions or discounts.",
+        expected_output=f"A structured dataset containing {product_name} information and pricing from multiple sources, with complete pricing details.",
+        agent=search
+    )
+    cleaning_task = Task(
+        description=f"Process the raw pricing data for {product_name} to standardize formats, handle currency conversions, remove outliers, and identify any inconsistencies or errors in the collected price information.",
+        expected_output=f"A cleaned dataset with uniformly formatted prices for {product_name}, standardized currencies, and annotations for any identified anomalies or special pricing conditions.",
+        agent=data_cleaner
+    )
+    comparison_task = Task(
+        description=f"Analyze the cleaned pricing data to identify the lowest available price for {product_name}, calculate price differences between retailers, and determine price-to-value ratios based on product specifications.",
+        expected_output=f"A comparative analysis showing price rankings for {product_name}, percentage differences between retailers, and identification of the best value options across different price points.",
+        agent=comparison
+    )
+    reporting_task = Task(
+        description=f"Create a comprehensive market insights report based on the {product_name} pricing analysis, highlighting best deals, pricing trends, and actionable recommendations for price-conscious consumers.",
+        expected_output=f"A detailed report for {product_name} with executive summary, visualizations of price comparisons, identification of pricing patterns, and specific recommendations for optimal purchasing decisions.",
+        agent=reporting_agent
+    )
+    # Return both agents and tasks
+    return search, data_cleaner, comparison, reporting_agent, search_task, cleaning_task, comparison_task, reporting_task

src/tools.py ADDED Viewed

	@@ -0,0 +1,5 @@

+from crewai_tools import ScrapeWebsiteTool, SerperDevTool
+# Initialize tools
+search_tool = SerperDevTool()
+scrape_tool = ScrapeWebsiteTool()

test.ipynb ADDED Viewed

	@@ -0,0 +1,284 @@

+{
+ "cells": [
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/opt/anaconda3/envs/agents/lib/python3.10/site-packages/pydantic/_internal/_config.py:295: PydanticDeprecatedSince20: Support for class-based `config` is deprecated, use ConfigDict instead. Deprecated in Pydantic V2.0 to be removed in V3.0. See Pydantic V2 Migration Guide at https://errors.pydantic.dev/2.10/migration/\n",
+      "  warnings.warn(DEPRECATION_MESSAGE, DeprecationWarning)\n",
+      "/opt/anaconda3/envs/agents/lib/python3.10/site-packages/pydantic/_internal/_generate_schema.py:502: UserWarning: <built-in function callable> is not a Python type (it may be an instance of an object), Pydantic will allow any object with no validation since we cannot even enforce that the input is an instance of the given type. To get rid of this error wrap the type with `pydantic.SkipValidation`.\n",
+      "  warn(\n",
+      "/opt/anaconda3/envs/agents/lib/python3.10/site-packages/crewai_tools/tools/scrapegraph_scrape_tool/scrapegraph_scrape_tool.py:34: PydanticDeprecatedSince20: Pydantic V1 style `@validator` validators are deprecated. You should migrate to Pydantic V2 style `@field_validator` validators, see the migration guide for more details. Deprecated in Pydantic V2.0 to be removed in V3.0. See Pydantic V2 Migration Guide at https://errors.pydantic.dev/2.10/migration/\n",
+      "  @validator(\"website_url\")\n",
+      "/opt/anaconda3/envs/agents/lib/python3.10/site-packages/crewai_tools/tools/selenium_scraping_tool/selenium_scraping_tool.py:26: PydanticDeprecatedSince20: Pydantic V1 style `@validator` validators are deprecated. You should migrate to Pydantic V2 style `@field_validator` validators, see the migration guide for more details. Deprecated in Pydantic V2.0 to be removed in V3.0. See Pydantic V2 Migration Guide at https://errors.pydantic.dev/2.10/migration/\n",
+      "  @validator(\"website_url\")\n",
+      "/opt/anaconda3/envs/agents/lib/python3.10/site-packages/crewai_tools/tools/vision_tool/vision_tool.py:15: PydanticDeprecatedSince20: Pydantic V1 style `@validator` validators are deprecated. You should migrate to Pydantic V2 style `@field_validator` validators, see the migration guide for more details. Deprecated in Pydantic V2.0 to be removed in V3.0. See Pydantic V2 Migration Guide at https://errors.pydantic.dev/2.10/migration/\n",
+      "  @validator(\"image_path_url\")\n"
+     ]
+    }
+   ],
+   "source": [
+    "import os\n",
+    "from crewai import Agent, Crew, Task, LLM, Process\n",
+    "from crewai_tools import ScrapeWebsiteTool, SerperDevTool\n",
+    "from dotenv import load_dotenv"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "load_dotenv()\n",
+    "CEREBRAS_API_KEY = os.getenv(\"CEREBRAS_API_KEY\")\n",
+    "SERPER_API_KEY = os.getenv(\"SERPER_API_KEY\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "if not CEREBRAS_API_KEY:\n",
+    "    raise ValueError(\"Missing Cerebras API Key! Set CEREBRAS_API_KEY in environment variables.\")\n",
+    "\n",
+    "if not SERPER_API_KEY:\n",
+    "    raise ValueError(\"Missing Serper API Key! Set SERPER_API_KEY in environment variables.\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "cerebras_llm = LLM(\n",
+    "    model=\"cerebras/llama-3.3-70b\",\n",
+    "    temperature=0.7,\n",
+    "    max_tokens=8192,\n",
+    "    api_key=CEREBRAS_API_KEY,\n",
+    "    base_url=\"https://api.cerebras.ai/v1\",\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Tools"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "search_tool = SerperDevTool()\n",
+    "scrape_tool = ScrapeWebsiteTool()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Agents"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "search = Agent(\n",
+    "        role=\"E-Commerce Market Research Analyst\",\n",
+    "        goal=f\"Provide up-to-date market analysis of {product_name} from E-commerce Industry\",\n",
+    "        backstory=\"An expert analyst with a keen eye for market trends\",\n",
+    "        tools=[search_tool, scrape_tool],\n",
+    "        verbose=True,\n",
+    "        llm=cerebras_llm\n",
+    "    )\n",
+    "\n",
+    "data_cleaner = Agent(\n",
+    "    role=\"Data Cleaning Specialist\",\n",
+    "    goal=f\"Ensure all price values for {product_name} are accurate, properly formatted, and free of inconsistencies.\",\n",
+    "    backstory=(\n",
+    "        \"An experienced data analyst with a strong background in data preprocessing, \"\n",
+    "        \"error detection, and price standardization. With expertise in handling messy datasets, \"\n",
+    "        \"you identify and clean incorrect, missing, or inconsistent price values, ensuring the data is reliable for further analysis.\"\n",
+    "    ),\n",
+    "    tools=[],\n",
+    "    verbose=True,\n",
+    "    llm=cerebras_llm\n",
+    ")\n",
+    "\n",
+    "comparison = Agent(\n",
+    "    role=\"Price Comparison Expert\",\n",
+    "    goal=f\"Analyze and compare {product_name} prices to identify the lowest price available.\",\n",
+    "    backstory=(\n",
+    "        \"A meticulous price analyst with expertise in comparing product prices across different sources. \"\n",
+    "        \"You efficiently process pricing data, highlight discrepancies, and determine the best deal for consumers.\"\n",
+    "    ),\n",
+    "    tools=[],\n",
+    "    verbose=True,\n",
+    "    llm=cerebras_llm\n",
+    ")\n",
+    "\n",
+    "reporting_agent = Agent(\n",
+    "    role=\"Market Insights Reporter\",\n",
+    "    goal=f\"Generate a comprehensive report summarizing price trends, differences, and the best available deals for {product_name}.\",\n",
+    "    backstory=(\n",
+    "        \"A skilled data journalist with experience in analyzing pricing trends and market fluctuations. \"\n",
+    "        \"You transform raw pricing data into insightful reports, providing actionable insights on cost-effective options.\"\n",
+    "    ),\n",
+    "    tools=[],\n",
+    "    verbose=True,\n",
+    "    llm=cerebras_llm\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Tasks "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "search_task = Task(\n",
+    "    description=f\"Collect current pricing data for {product_name} from at least 3 major e-commerce platforms. Include product name, model, specifications, price, and any ongoing promotions or discounts.\",\n",
+    "    expected_output=f\"A structured dataset containing {product_name} information and pricing from multiple sources, with complete pricing details.\",\n",
+    "    agent=search\n",
+    ")\n",
+    "\n",
+    "cleaning_task = Task(\n",
+    "    description=f\"Process the raw pricing data for {product_name} to standardize formats, handle currency conversions, remove outliers, and identify any inconsistencies or errors in the collected price information.\",\n",
+    "    expected_output=f\"A cleaned dataset with uniformly formatted prices for {product_name}, standardized currencies, and annotations for any identified anomalies or special pricing conditions.\",\n",
+    "    agent=data_cleaner\n",
+    ")\n",
+    "\n",
+    "comparison_task = Task(\n",
+    "    description=f\"Analyze the cleaned pricing data to identify the lowest available price for {product_name}, calculate price differences between retailers, and determine price-to-value ratios based on product specifications.\",\n",
+    "    expected_output=f\"A comparative analysis showing price rankings for {product_name}, percentage differences between retailers, and identification of the best value options across different price points.\",\n",
+    "    agent=comparison\n",
+    ")\n",
+    "\n",
+    "reporting_task = Task(\n",
+    "    description=f\"Create a comprehensive market insights report based on the {product_name} pricing analysis, highlighting best deals, pricing trends, and actionable recommendations for price-conscious consumers.\",\n",
+    "    expected_output=f\"A detailed report for {product_name} with executive summary, visualizations of price comparisons, identification of pricing patterns, and specific recommendations for optimal purchasing decisions.\",\n",
+    "    agent=reporting_agent\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "2025-02-25 03:19:07,616 - 8480639040 - __init__.py-__init__:537 - WARNING: Overriding of current TracerProvider is not allowed\n"
+     ]
+    }
+   ],
+   "source": [
+    "product_price_crew = Crew(\n",
+    "    agents=[search, data_cleaner, comparison, reporting_agent],\n",
+    "    tasks=[search_task, cleaning_task, comparison_task, reporting_task], \n",
+    "    verbose=True,\n",
+    "    process=Process.sequential,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 25,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "product_name = \"Sony WH-1000XM5\"\n",
+    "country = \"United States\"\n",
+    "# format = {'product': product_name, 'country': country}\n",
+    "format = {'product': product_name}"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "product_name = \"Lenovo Earbuds LP40\"\n",
+    "country = \"United States\"\n",
+    "format = {'product': product_name, 'country': country}"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [
+    {
+     "ename": "NameError",
+     "evalue": "name 'product_price_crew' is not defined",
+     "output_type": "error",
+     "traceback": [
+      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
+      "\u001b[0;31mNameError\u001b[0m                                 Traceback (most recent call last)",
+      "Cell \u001b[0;32mIn[1], line 2\u001b[0m\n\u001b[1;32m      1\u001b[0m \u001b[38;5;66;03m# Execute Crew\u001b[39;00m\n\u001b[0;32m----> 2\u001b[0m event_analysis \u001b[38;5;241m=\u001b[39m \u001b[43mproduct_price_crew\u001b[49m\u001b[38;5;241m.\u001b[39mkickoff(inputs\u001b[38;5;241m=\u001b[39m\u001b[38;5;28mformat\u001b[39m)\n\u001b[1;32m      3\u001b[0m \u001b[38;5;66;03m# Print the final report\u001b[39;00m\n\u001b[1;32m      4\u001b[0m \u001b[38;5;28mprint\u001b[39m(event_analysis)\n",
+      "\u001b[0;31mNameError\u001b[0m: name 'product_price_crew' is not defined"
+     ]
+    }
+   ],
+   "source": [
+    "# Execute Crew\n",
+    "event_analysis = product_price_crew.kickoff(inputs=format)\n",
+    "# Print the final report\n",
+    "print(event_analysis)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "agents",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.16"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}