First_agent_template

Sleeping

sayed99 commited on Feb 18

Commit

e104f68

1 Parent(s): 48d2716

teached the agent to generate and caption images(connect things together)

Files changed (2) hide show

prompts.yaml CHANGED Viewed

@@ -21,15 +21,32 @@
   ```<end_code>
   Observation: "The oldest person in the document is John Doe, a 55 year old lumberjack living in Newfoundland."
-  Thought: I will now generate an image showcasing the oldest person and save it in generations dir.
   Code:
   ```py
-  import uuid
   image, img_path = image_generator("A portrait of John Doe, a 55-year-old man living in Canada.")
   final_answer(image)
-  final_answer(img_path)
   ```<end_code>
   ---
   Task: "What is the result of the following operation: 5 + 3 + 1294.678?"

   ```<end_code>
   Observation: "The oldest person in the document is John Doe, a 55 year old lumberjack living in Newfoundland."
+  Thought: I will now generate an image showcasing the oldest person and it will be saved in generations dir.
   Code:
   ```py
   image, img_path = image_generator("A portrait of John Doe, a 55-year-old man living in Canada.")
+  print("The image has been generated and saved in ", img_path)
   final_answer(image)
   ```<end_code>
+  ---
+  Task: "Generate image for a person walking on the moon, and then caption it"
+  Thought: First I will use the tool `image_generator` to generate an image of a person walking on the moon.
+  Code:
+  ```py
+  image, img_path = image_generator("A person walking on the moon.")
+  print("The image has been generated and saved in ", img_path)
+  final_answer(image)
+  ```<end_code>
+  Observation: "The image has been generated and saved in the 'generations' directory."
+  Thought: Now I will use the tool `image_captioning` to caption the generated image.
+  Code:
+  ```py
+  caption = image_captioning(image)
+  final_answer(caption)
+  ```<end_code>
   ---
   Task: "What is the result of the following operation: 5 + 3 + 1294.678?"

tools/image_generation.py CHANGED Viewed

@@ -11,15 +11,12 @@ from PIL import Image
 class ImageGenerationTool(Tool):
     name = "image_generation"
     description = """
-    Generates an image based on the given prompt and saves it locally.
     Args:
         prompt (str): The prompt for image generation.
     Returns:
-        tuple: A tuple containing:
-            - Image.Image: The generated image.
-            - pathlib.Path: The file path where the image was saved.
     """
     inputs = {'prompt': {'type': 'string',
                          'description': 'The prompt for image generation.'}}

 class ImageGenerationTool(Tool):
     name = "image_generation"
     description = """
+    Generates an image based on the given prompt and saves it locally at generations dir.
     Args:
         prompt (str): The prompt for image generation.
     Returns:
+        - Image.Image: The generated image.
+        - str: The path where the image is saved.
     """
     inputs = {'prompt': {'type': 'string',
                          'description': 'The prompt for image generation.'}}