Your ultimate guide

Showing posts with label Selenium. Show all posts
Showing posts with label Selenium. Show all posts

Automation Frameworks

Frameworks in Test Automation:
Utilizing a framework facilitates the structured organization of code, providing clear guidelines and best practices to methodically achieve desired outcomes.

Types of Automation Frameworks:
  1. Linear Automation Framework
  2. Modular Driven Framework
  3. Data-Driven Framework
  4. Keyword-Driven Framework
  5. Hybrid Testing Framework
  6. Behavior-Driven Development (BDD) Framework
Let's discuss each of these in detail.




1. Linear Automation Framework
    The Linear Automation Framework, often referred to as the Record & Playback approach, is characterized by its simplicity and ease of implementation. Unlike more complex frameworks, such as Data-Driven or Keyword-Driven frameworks, this approach involves the creation of test scripts within a single class or script file.

How Does It Work?
    Testers record their interactions with the application under test using specialized tools. These interactions are then converted into script commands stored in a single script file. During playback, the recorded actions are executed sequentially, mimicking user interactions.

Advantages:
  • Quick Script Generation – Testers can create scripts quickly by recording their actions without extensive scripting knowledge.
  • Low Barrier to Entry – Beginners can start automating tests with minimal training.
  • Tool Familiarization – Helps testers become familiar with automation tools.

Disadvantages:
  • Limited Reusability – Scripts are linear and cannot be easily reused across multiple tests.
  • Hardcoded Test Data – Test data is embedded within scripts, making updates challenging.
  • Maintenance Challenges – As the test suite grows, maintaining scripts becomes cumbersome.




2. Modular Driven Framework
    The Modular Driven Framework breaks down the application into smaller, independent parts (modules). Each module is tested separately using dedicated scripts, improving reusability and organization.

Advantages:
  • Enhanced Reusability – Test scripts can be reused across different scenarios.
  • Targeted Maintenance – Changes in an application require only specific modules to be updated.
  • Cost-Efficient Maintenance – Clearly separated modules simplify updates and maintenance.
Challenges:
  • Requires Technical Expertise – Knowledge of programming and testing concepts is needed.
  • Time-Consuming Setup – Compared to linear automation, this approach requires careful planning.
  • Hardcoded Test Data – Like the Linear Framework, test data may be embedded in scripts.




3. Data-Driven Testing Framework
    This framework builds upon the Modular Driven Framework by allowing test data to be dynamically supplied from external sources, such as:
  • Excel files
  • CSV files
  • Databases
  • Text files
    Instead of embedding data within scripts, test scripts retrieve data from these external files during execution.

When to Use:
    When testing the same functionality multiple times with different sets of input data.

Advantages:
  • Separation of Test Data and Test Scripts – Updating test data does not affect scripts.
  • Improved Test Coverage – A single test script can be reused with multiple datasets.
  • Flexibility & Maintainability – Test data can be updated independently.
Disadvantages:
  • Increased Complexity – Setting up test data sources requires extra effort.
  • Requires Programming Skills – Testers must understand scripting and data handling.
  • Time-Consuming Setup – Requires significant planning and preparation.




4. Keyword-Driven Testing Framework
    This framework uses keywords stored in an external data file to drive test execution. These keywords represent specific actions to be performed, making the framework structured and reusable.

Advantages:
  • No Programming Skills Required – Testers can create tests using predefined keywords.
  • Simplified Script Creation – Once set up, writing new tests is easier.
  • High Code Reusability – Keywords can be used across multiple scripts.
  • Tool Independence – Works with various test automation tools.
  • Application Independence – Test cases can be designed even before development is complete.
Disadvantages:
  • High Initial Setup Effort – Requires significant time and automation expertise.
  • Higher Initial Investment – More suited for large applications.
  • Complex Maintenance – Managing keywords can become challenging as the framework grows.
  • Longer Learning Curve – Testers must learn how to use and manage keywords effectively.




5. Hybrid Testing Framework
    Hybrid Testing Framework combines features of multiple frameworks, such as Keyword-Driven and Data-Driven approaches, to enhance flexibility and scalability.

Key Components:
  • Driver Script – Controls test execution flow.
  • Configuration File – Stores settings and environment variables.
  • Run Manager – Determines which tests to run.
  • Test Cases & Scripts – Defines automation scenarios.
  • Object Repository – Stores UI elements for easy identification.
  • Test Data – Externalized data sources (Excel, CSV, Databases).
  • Results & Logs – Stores test execution reports.
  • Function Library – Collection of reusable functions.
  • Scheduler – Automates test execution at predefined intervals.
Advantages:
  • Best of Multiple Frameworks – Combines strengths of Keyword-Driven and Data-Driven frameworks.
  • High Flexibility & Reusability – Adaptable to different testing needs.
  • Faster Execution – Optimized test execution time.
  • Scalability – Supports multi-platform and cross-browser testing.
Disadvantages:
  • Requires Strong Technical Expertise – Developers must understand multiple frameworks.
  • Higher Initial Effort – Requires significant setup time and investment.




6. Behavior-Driven Development (BDD) Framework
    BDD enables teams to create test cases in plain, human-readable language (e.g., English). This improves collaboration between testers, developers, and business analysts.

Given-When-Then Approach:
BDD follows a structured format:
  • Given – Defines initial conditions.
  • When – Describes user actions.
  • Then – Specifies expected outcomes.
Example Feature File (Cucumber - Gherkin Format): gherkin

Feature: BDD implementation using Cucumber  
Scenario: Login to Gmail using Cucumber plugin  
  Given User is navigating to the Gmail login page  
  When User enters username as "Username" and password as "Password"  
  Then User is successfully navigated to the Gmail mailbox

Advantages:
  • Improved Test Coverage – Ensures all user stories are tested.
  • Readable Scenarios – Uses natural language for easy understanding.
  • Test Automation Friendly – Works with Cucumber, SpecFlow, and JBehave.
  • Code Reusability – Steps can be reused across test cases.
  • CI/CD Integration – Easily fits into Continuous Integration pipelines.

Disadvantages:
  • Requires Collaboration – Close coordination between developers and testers is needed.
  • Risk of Misuse – If not used correctly, BDD can turn into an inefficient testing approach.

Conclusion:
Selecting the right automation framework depends on project requirements, team expertise, and long-term maintainability. Linear and Modular frameworks are best for small projects, while Data-Driven, Keyword-Driven, Hybrid, and BDD frameworks are ideal for large-scale automation efforts.

File Upload Using Selenium WebDriver

Exploring Different Approaches for File Upload with Selenium WebDriver:

........Incomplete.......
    File uploads are a common functionality on web applications, and automating this process is crucial for efficient testing with Selenium WebDriver. In this blog post, we will explore three different methods to upload files using Selenium WebDriver: 'SendKeys', the 'Robot' class, and 'AutoIT'.

Using SendKeys:
    The most straightforward approach for uploading a file in Selenium WebDriver is to use the 'SendKeys' method. This method is applicable only if the HTML input element has the attribute type="file". This attribute is specific to file upload functionality, and using Selenium's sendKeys method allows for the automatic selection of the desired file for this type of input field.
Syntax:
  1. WebElement fileInput = driver.findElement(By.id("idvlaue"));
  2. fileInput.sendKeys("path/to/your/file.txt");
Pros:
  • Directly interacts with the file input element.
  • Simple and concise code for file uploads.
  • No external dependencies are required.
Cons:
  • Limited to scenarios where the file input element is present.

Using Robot Class:
    The Robot class in Java can be used to simulate keyboard and mouse actions. In the context of the uploads, the Robot class can be used to navigate the file selection dialogue.

Example Program:
  1. import java.awt.AWTException;
  2. import java.awt.Robot;
  3. import java.awt.Toolkit;
  4. import java.awt.datatransfer.StringSelection;
  5. import java.awt.event.KeyEvent;
  6.  
  7. import org.openqa.selenium.By;
  8. import org.openqa.selenium.WebDriver;
  9. import org.openqa.selenium.WebElement;
  10. import org.openqa.selenium.chrome.ChromeDriver;
  11. import org.openqa.selenium.chrome.ChromeOptions;
  12.  
  13. public class Dummy2 {
  14. public static void main(String[] args) throws AWTException {
  15. ChromeOptions co = new ChromeOptions();
  16. co.addArguments("-remote-allow-origins");
  17.  
  18. WebDriver driver = new ChromeDriver();
  19.                 driver.get("https://testingcolleges.blogspot.com/2024/01/file-upload-demo-page.html");
  20.  
  21. WebElement fileUploadBtn = driver.findElement(By.className("file-label-2"));
  22. fileUploadBtn.click();
  23.  
  24. Robot rbt = new Robot();
  25. rbt.delay(2000); // just like thread.sleep
  26.  
  27. //these below 2 lines used to copy the path 
  28. StringSelection ss = new StringSelection("C:\\Users\\Dollar\\Desktop\\Dummy.txt");
  29. // copy the above path in clipboard
  30. Toolkit.getDefaultToolkit().getSystemClipboard().setContents(ss, null);
  31. //Toolkit.getDefaultToolkit().getSystemClipboard().setContents("path", null);
  32.  
  33. //perform Ctrl+V operation to pass the copied path
  34. rbt.keyPress(KeyEvent.VK_CONTROL);
  35. rbt.keyPress(KeyEvent.VK_V);
  36. rbt.keyRelease(KeyEvent.VK_CONTROL);
  37. rbt.keyRelease(KeyEvent.VK_V);
  38.  
  39. //Enter press
  40. rbt.keyPress(KeyEvent.VK_ENTER);
  41. rbt.keyRelease(KeyEvent.VK_ENTER);
  42.  
  43. }
  44. }

Pros:
  • Works across different browsers.
  • Can handle scenarios with hidden file input fields.
Cons:
  • Requires handling keyboard events, which may be error-prone.
  • Platform-dependent and may need adjustments for different operating systems.
Using AutoIT:
    AutoIT is a scripting language designed for automating the Windows GUI and general scripting tasks. It can be integrated with Selenium WebDriver to handle file Uploads.
Pros:
  • Powerful for handling complex file upload scenarios.
  • Works seamlessly across different browsers and operating systems.
Cons:
  • Requires installing AutoIT and writing an external script.
  • Add an external dependency to your automation project.


Conclusion:
    Each method for file upload with Selenium WebDriver has its advantages and limitations. The choice of method depends on the specific requirements of your application and the level of control and compatibility needed. While SendKeys is suitable for simple scenarios, the Robot class and AutoIT offer more flexibility and control, making them valuable tools for handling complex file upload functionalities.




Browser Window Sizes and Positions


maximize(): The maximize() method is used to maximize the browser window. This is often done to ensure that the web page is displayed in its full size on the screen. Here's an example of how you can use the 'maximize' method in selenium with Java:

  1. public class Maximize {
  2.     public static void main(String[] args) {
  3.         WebDriver driver = new ChromeDriver();
  4.         driver.manage().window().maximize();
  5.     }
  6. }

How to Minimize the Browser Window in Selenium?
    Selenium's Java has no built-in method to minimize the browser window. However, Selenium's Python does have a minimize function.




getSize(): The getSize() method is used to get the height and width of a WebElement/BrowserWindow. The getSize() method returns an object of type 'Dimension', and you can use its 'getHeight' and 'getWidth' methods to obtain the height and width, respectively.

Example 1: To get the height and width of a web element.
  1. import org.openqa.selenium.By;
  2. import org.openqa.selenium.Dimension;
  3. import org.openqa.selenium.WebDriver;
  4. import org.openqa.selenium.WebElement;
  5. import org.openqa.selenium.chrome.ChromeDriver;
  6. public class GetElementSize {
  7.     public static void main(String[] args) {
  8.         WebDriver driver = new ChromeDriver();
  9.         driver.get("https://example.com");
  10.         WebElement element = driver.findElement(By.id("value"));
  11.         Dimension size = element.getSize();
  12.         System.out.println("Height: " + size.getHeight());
  13.         System.out.println("Width: " + size.getWidth());
  14.     }
  15. }
Example 2: To get the height and width of a Browser Window.
  1. import org.openqa.selenium.Dimension;
  2. import org.openqa.selenium.WebDriver;
  3. import org.openqa.selenium.chrome.ChromeDriver;
  4. public class GetBrowserWindowSize {
  5.     public static void main(String[] args) {
  6.         WebDriver driver = new ChromeDriver();
  7.         driver.get("https://example.com");
  8.         Dimension windowSize = driver.manage().window().getSize();
  9.         System.out.println("Window Height: " + windowSize.getHeight());
  10.         System.out.println("Window Width: " + windowSize.getWidth());
  11.     }
  12. }
Note: In the above example instead of creating the Dimension class object we can also directly give the methods:
driver.manage().window().getSize().getHeight();
driver.manage().window().getSize().getWidth();




Resize the Browser window:
    To resize the Browser window. 'manage().window().setSize()' is a predefined method of the Selenium 'WebDriver' class.
Syntax 1:
  1. Dimension newSize = new Dimension(800, 600); //(width, height)
  2. driver.manage().window().setSize(newSize);

Syntax 2:
  1. driver.manage().window.setSize(new Dimension(800, 600));

Example:
  1. import org.openqa.selenium.Dimension;
  2. import org.openqa.selenium.WebDriver;
  3. import org.openqa.selenium.chrome.ChromeDriver;
  4. public class ResizeWindow {
  5.     public static void main(String[] args) {
  6.         System.setProperty("webdriver.chrome.driver", "path/chromedriver.exe");
  7.         WebDriver driver = new ChromeDriver();
  8.         driver.get("http://example.com/");
  9.         Dimension newSize = new Dimension(800, 600);
  10.         driver.manage().window().setSize(newSize);
  11.     }
  12. }
Note: In Eclipse, you may have recommend two import statements for the 'Dimension' class: 'import java.awt.Dimension;' and 'import org.openqa.selenium.Dimension;'. You should only select 'import org.openqa.selenium.Dimension;'. 




getPosition():
    getPosition() method gets the position of the current window, relative to the upper left corner of the screen. The method takes no arguments and returns a Point object, which contains the X and Y coordinates of the window.

Syntax:(With Point object)
  1. // Get the position of the current window
  2. Point p = driver.manage().window().getPosition();
  3. // Print the X and Y coordinates of the window
  4. System.out.println("X Coordinate: " + p.getX());
  5. System.out.println("Y Coordinate: " + p.getY());
Syntax: (We can also print the X and Y Coordinates without creating a Point object)
  1. System.out.println("X Coordinate: "driver.manage().window().getPosition().getX());
  2. System.out.println("Y Coordinate: " + driver.manage().window().getPosition().getY());

Example:
  1. import org.openqa.selenium.WebDriver;
  2. import org.openqa.selenium.chrome.ChromeDriver;
  3. public class WindowPosition {
  4.     public static void main(String[] args) {
  5.         System.setProperty("webdriver.chrome.driver", "path/chromedriver.exe");
  6.         WebDriver driver = new ChromeDriver();
  7.         driver.get("http://testingcolleges.blogspot.com/");
  8.         System.out.println("X Coordinate: " + driver.manage().window().getPosition().getX());
  9.         System.out.println("Y Coordinate: " + driver.manage().window().getPosition().getY());
  10.     }
  11. }
Output:
X Coordinate: 10
Y Coordinate: 10




setPosition():
    The setPosition() method in Selenium Webdriver sets the position of the current window relative to the top left corner of the screen. The Point class instance specifies the target position of the window.

Syntax 1:
  1. Point p = new Point(250, 250);
  2. driver.manage().window().setPosition(p);
Syntax 2:
  1. driver.manage().window().setPosition(new Point(250, 250));

Example:
  1. import org.openqa.selenium.WebDriver;
  2. import org.openqa.selenium.chrome.ChromeDriver;
  3. import org.openqa.selenium.Point;
  4. public class Position {
  5.     public static void main(String[] args) {
  6.         System.setProperty("webdriver.chrome.driver", "path/chromedriver.exe");
  7.         WebDriver driver = new ChromeDriver();
  8.         driver.get("http://testingcolleges.blogspot.com/");
  9.         Point p = new Point(250,250);
  10.         driver.manage().window().setPosition(p);
  11.     }
  12. }




****

Handling Alerts with Selenium


Handling Alerts:
    In Selenium, alerts are popup dialog boxes that appear on a web page to provide information, take user input or request confirmation of specific actions. Selenium provides methods to interact with these alerts, allowing you to automate actions such as accepting, dismissing, or entering text into them.
Here's how you can work with alerts in Selenium:

switchTo().alert():
    To interact with an alert, you first need to switch the WebDirver's focus to it using the 'switchTo().alert()' method, and then we need to perform the action.
  1. Accept an Alert: Click OK an alert and close it
    • driver.switchTo().alert().accept();
  2. Dismiss an Alert: Click Cancel or the equivalent of an alert and close it.
    • driver.switchTo().alert().dismiss();
  3. Enter Text into a Prompt Alert: If the alert is prompt that requires you to enter text, you can use the 'sendKeys()' method to provide input.
    • driver.switchTo().alert().sendKeys("Your Input text");
  4. Get Text from an Alert: To retrieve the text displayed in an alert.
    • driver.switchTo().alert().getText();

Example Program:
  1. import org.openqa.selenium.By;
  2. import org.openqa.selenium.WebDriver;
  3. import org.openqa.selenium.WebElement;
  4. import org.openqa.selenium.chrome.ChromeDriver;

  5. public class Alert2 {
  6.   public static void main(String[] args) throws InterruptedException {
  7.     WebDriver driver = new ChromeDriver();
  8.     driver.get("https://testingcolleges.blogspot.com/2023/08/alerts-demo-pages.html");
  9.     WebElement confirmbox = driver.findElement(By.id("confirmbox"));
  10.     confirmbox.click();
  11.     driver.switchTo().alert().accept();
  12.     confirmbox.click();
  13.     driver.switchTo().alert().dismiss();
  14.     WebElement promptbox = driver.findElement(By.id("promptbox"));
  15.     promptbox.click();
  16.     driver.switchTo().alert().sendKeys("Happy");
  17.     driver.switchTo().alert().accept();
  18.   }
  19. }



 
switchTo().activeElement()
    To automate active element current webpage.
For example, when we open some websites, by default one element will be selected to select such type of active element we will use this method.

Example: driver.switchTo().activeElement().sendKeys(“Hello”);





Web Elements Locators

How to Inspect the web elements of a web page?

  • Every web page has different web elements.
    • To identify the web element, use the developer tool to inspect the element.
    • For that, Right-Click on the Web Page ➡ Select Inspect ➡ Select the Element tab. (or Shortcut Key F12)
  • Every web element contains tags.
  • Those tags contain attributes.
  • Attributes contain values.
  • Example: <span class="DPvwYc GHpiyd" aria-hidden="true">Hello</span>
    Note: Web elements are common for all browsers.



What is findElement?
    It is the method used to locate elements on the webpage.
What is By?
    By is the mechanism used to locate elements within documents with the help of locator value.
What is a Locator?
    It is an address that identifies a web element uniquely within the web page.




Locators:
    There are eight types of locators.

  1. id
  2. name
  3. xpath
  4. cssSelector
  5. className
  6. tagName
  7. linkText
  8. partialLinkText

id Locator:
  • We can identify elements by using the ‘id’ attribute.
    • Syntax: <object>.findElement(By.id("id_value"));
    • Example: driver.findElement(By.id("dollar"));

  •     Example Program:
  1. import org.openqa.selenium.By;
  2. import org.openqa.selenium.WebDriver;
  3. import org.openqa.selenium.chrome.ChromeDriver;
  4. public class LocatorID {
  5.    public static void main(String[] args) {
  6.       WebDriver driver = new ChromeDriver();
  7.       driver.get("https://testingcolleges.blogspot.com/");
  8.       driver.findElement(By.id("dollar")).click();
  9.    }
  10. }



name Locator:
  • Name attribute can also identify an element.
    • Example: driver.findElement(By.name(“value”));



       xpath:

Xpath is an XML path used for navigation through the HTML structure of the page.

Right-click on the tag of the particular field (on - DOM) ➡Select copy ➡ Select copy x path

 
There are two types of XPath
1.    Absolute path:
a.    Full path from top to bottom /end.
b.    It starts with a single slash (/).
c.     Search from the root node
Example: /html/body/div/input/div[2]/ul/li[2]/……..
2.    Relative XPath:
a.    Middle of the path.
b.    It starts with a double slash (//).
c.     Search starts from the middle of the DOM
Example: //*[@id=’value’]

          Example: driver.findElement(By.xpath(“//input[@id=’value’]”))


  • Example program:
  1. import org.openqa.selenium.By;
  2. import org.openqa.selenium.WebDriver;
  3. import org.openqa.selenium.WebElement;
  4. import org.openqa.selenium.chrome.ChromeDriver;
  5. public class LocatorXpath {
  6.    public static void main(String[] args) {
  7.       WebDriver driver = new ChromeDriver();
  8.       driver.get("https://testingcolleges.blogspot.com/");
  9.       WebElement elm = driver.findElement(By.xpath("//*[@id=\'dollar\']/a/span/b/span"));
  10.       elm.click();
  11.    }
  12. }


cssSelector:

CSS is used to create style rules for web pages and can be used to identify any web element.

Right-Click on a particular element tag à Select Copy à Select Copy Selector


  1. It starts with #.
  2. Xpath does not work in the Internet Explorer browser in that scenario they will use cssSelector.
  3. XPath and cssSelectors are both the same.
  • Example: driver.findElement(By.cssSelector(“#input[@attribute]”))



className:
    A ClassName operator uses a class attribute to identify an object.
        Example: driver.findElement(By.className(‘value’))



tagName:
    In real-time we rarely use it. Here we will assign the values based on the main tag, not an attribute.
  • Example: driver.findElement(By.tagName(“INPUT”));
Note: The tag name can be declared using either capital or lowercase letters. 

  • Example Program: Get the number of links on a web page using Selenium with Java?
  1. import java.util.List;
  2. import org.openqa.selenium.By;
  3. import org.openqa.selenium.WebDriver;
  4. import org.openqa.selenium.WebElement;
  5. import org.openqa.selenium.chrome.ChromeDriver;

  6. public class Links {
  7. public static void main(String[] args) {
  8. WebDriver driver = new ChromeDriver();
  9. driver.get("https://www.google.com");
  10. List<WebElement> l = driver.findElements(By.tagName("a"));
  11. System.out.println(l.size());
  12. driver.quit();
  13. }
  14. }
  15. //Output: 25




linkText:
    Text (visible text) used in hyperlinks can also locate elements.
        Example: driver.findElement(By.linkText(“Gmail”));
  • Example program:
  1. import org.openqa.selenium.By;
  2. import org.openqa.selenium.WebDriver;
  3. import org.openqa.selenium.WebElement;
  4. import org.openqa.selenium.chrome.ChromeDriver;
  5. public class LocatoeLinkText {
  6.    public static void main(String[] args) {
  7.       WebDriver driver = new ChromeDriver();
  8.       driver.get("https://testingcolleges.blogspot.com/");
  9.       WebElement elm = driver.findElement(By.linkText("C# vs Java"));
  10.       elm.click();
  11.    }
  12. }



partialLinkText:
    Part of the text in the link can also identify an element.
        Example: link text:- click here to download this song
            driver.findElement(By.partialLinkText(“click here”));



- Next Page: Navigation Commands